Ontology specific data mining based on dynamic grammars

Daniel Quest, Hesham H Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular expressions in allowing the use of advanced queries in comparing sequences and searching fro motifs or sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic that is provided at runtime.

Original languageEnglish (US)
Title of host publicationProceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004
Pages495-496
Number of pages2
StatePublished - Dec 1 2004
EventProceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004 - Stanford, CA, United States
Duration: Aug 16 2004Aug 19 2004

Publication series

NameProceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004

Conference

ConferenceProceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004
CountryUnited States
CityStanford, CA
Period8/16/048/19/04

Fingerprint

Data mining
Ontology
Sorting
Engines

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Quest, D., & Ali, H. H. (2004). Ontology specific data mining based on dynamic grammars. In Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004 (pp. 495-496). (Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004).

Ontology specific data mining based on dynamic grammars. / Quest, Daniel; Ali, Hesham H.

Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004. 2004. p. 495-496 (Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Quest, D & Ali, HH 2004, Ontology specific data mining based on dynamic grammars. in Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004. Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004, pp. 495-496, Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004, Stanford, CA, United States, 8/16/04.
Quest D, Ali HH. Ontology specific data mining based on dynamic grammars. In Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004. 2004. p. 495-496. (Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004).
Quest, Daniel ; Ali, Hesham H. / Ontology specific data mining based on dynamic grammars. Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004. 2004. pp. 495-496 (Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004).
@inproceedings{9f15f2b282914894bec7b81c3f8f2a07,
title = "Ontology specific data mining based on dynamic grammars",
abstract = "In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular expressions in allowing the use of advanced queries in comparing sequences and searching fro motifs or sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic that is provided at runtime.",
author = "Daniel Quest and Ali, {Hesham H}",
year = "2004",
month = "12",
day = "1",
language = "English (US)",
isbn = "0769521940",
series = "Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004",
pages = "495--496",
booktitle = "Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004",

}

TY - GEN

T1 - Ontology specific data mining based on dynamic grammars

AU - Quest, Daniel

AU - Ali, Hesham H

PY - 2004/12/1

Y1 - 2004/12/1

N2 - In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular expressions in allowing the use of advanced queries in comparing sequences and searching fro motifs or sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic that is provided at runtime.

AB - In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular expressions in allowing the use of advanced queries in comparing sequences and searching fro motifs or sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic that is provided at runtime.

UR - http://www.scopus.com/inward/record.url?scp=14044262932&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=14044262932&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:14044262932

SN - 0769521940

SN - 9780769521947

T3 - Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004

SP - 495

EP - 496

BT - Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004

ER -