A grammar based approach for mining bioinformatics databases

Daniel Quest, Hesham H. Ali

Research output: Contribution to journalConference article

Abstract

In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular grammars in allowing the use of advanced queries in comparing sequences and searching for motifs or interior-sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic and is provided at runtime.

Original languageEnglish (US)
Number of pages1
JournalProceedings of the Annual Hawaii International Conference on System Sciences
StatePublished - Nov 10 2005
Event38th Annual Hawaii International Conference on System Sciences - Big Island, HI, United States
Duration: Jan 3 2005Jan 6 2005

Fingerprint

Bioinformatics
Data mining
Sorting
Ontology
Engines

ASJC Scopus subject areas

  • Engineering(all)

Cite this

@article{ec0d02ae383d44fd906f4488bcbacb75,
title = "A grammar based approach for mining bioinformatics databases",
abstract = "In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular grammars in allowing the use of advanced queries in comparing sequences and searching for motifs or interior-sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic and is provided at runtime.",
author = "Daniel Quest and Ali, {Hesham H.}",
year = "2005",
month = "11",
day = "10",
language = "English (US)",
journal = "Proceedings of the Annual Hawaii International Conference on System Sciences",
issn = "1530-1605",

}

TY - JOUR

T1 - A grammar based approach for mining bioinformatics databases

AU - Quest, Daniel

AU - Ali, Hesham H.

PY - 2005/11/10

Y1 - 2005/11/10

N2 - In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular grammars in allowing the use of advanced queries in comparing sequences and searching for motifs or interior-sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic and is provided at runtime.

AB - In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular grammars in allowing the use of advanced queries in comparing sequences and searching for motifs or interior-sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic and is provided at runtime.

UR - http://www.scopus.com/inward/record.url?scp=27544504042&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=27544504042&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:27544504042

JO - Proceedings of the Annual Hawaii International Conference on System Sciences

JF - Proceedings of the Annual Hawaii International Conference on System Sciences

SN - 1530-1605

ER -