Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Since the explosive influx of biological data obtained from high-throughput medical instruments, the ability to leverage the currently available data to extract useful knowledge has become one of the most challenging problems in biomedical research. The analysis of such data is particularly complex not only due to its massive size but also due to its heterogeneity and inherent noise associated with several data gathering steps. The utilization of biological networks to model and integrate large-scale heterogeneous biomedical data continues to grow, especially with the systems biology approach taking center stage in many bioinformatics applications. Although loaded with biologically relevant signals, correlation networks do contain noise and are too large for simple data mining tools. In this project, we implement different types of filters to reduce the network size and sort out signals from noise. We propose a new approach for generating various filters that iterate on sub graphs along a spectrum between spanning tree and chordal filters. We show how different network filters incrementally obtain various clusters along this spectrum to maintain structural and domain-relevant components of the original network, while reducing noise. We test the proposed approach using gene expression levels obtained from diabetes and yeast datasets and compare the filtered networks with original networks using ontology enrichment. The obtained results support our main hypothesis that the filters conserve important elements from the original networks while uncovering new biologically significant clusters. However, results analyzing maintained and uncovered biologically significant hubs were inconclusive.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013
PublisherIEEE Computer Society
Pages584-591
Number of pages8
DOIs
StatePublished - 2013
Event2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013 - Dallas, TX
Duration: Dec 7 2013Dec 10 2013

Other

Other2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013
CityDallas, TX
Period12/7/1312/10/13

Fingerprint

Bioinformatics
Medical problems
Gene expression
Yeast
Data mining
Ontology
Throughput
Systems Biology

Keywords

  • Clusters
  • Correlation networks
  • Gene expressions
  • Hubs
  • Ontology enrichment
  • Systems biology

ASJC Scopus subject areas

  • Software

Cite this

West, S., Cooper, K. M., Bhowmick, S., & Ali, H. H. (2013). Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment. In Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013 (pp. 584-591). [6753973] IEEE Computer Society. https://doi.org/10.1109/ICDMW.2013.147

Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment. / West, Sean; Cooper, Kathryn M; Bhowmick, Sanjukta; Ali, Hesham H.

Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013. IEEE Computer Society, 2013. p. 584-591 6753973.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

West, S, Cooper, KM, Bhowmick, S & Ali, HH 2013, Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment. in Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013., 6753973, IEEE Computer Society, pp. 584-591, 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013, Dallas, TX, 12/7/13. https://doi.org/10.1109/ICDMW.2013.147
West S, Cooper KM, Bhowmick S, Ali HH. Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment. In Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013. IEEE Computer Society. 2013. p. 584-591. 6753973 https://doi.org/10.1109/ICDMW.2013.147
West, Sean ; Cooper, Kathryn M ; Bhowmick, Sanjukta ; Ali, Hesham H. / Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment. Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013. IEEE Computer Society, 2013. pp. 584-591
@inproceedings{34311f8ab6c04e9da420b72e352343f3,
title = "Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment",
abstract = "Since the explosive influx of biological data obtained from high-throughput medical instruments, the ability to leverage the currently available data to extract useful knowledge has become one of the most challenging problems in biomedical research. The analysis of such data is particularly complex not only due to its massive size but also due to its heterogeneity and inherent noise associated with several data gathering steps. The utilization of biological networks to model and integrate large-scale heterogeneous biomedical data continues to grow, especially with the systems biology approach taking center stage in many bioinformatics applications. Although loaded with biologically relevant signals, correlation networks do contain noise and are too large for simple data mining tools. In this project, we implement different types of filters to reduce the network size and sort out signals from noise. We propose a new approach for generating various filters that iterate on sub graphs along a spectrum between spanning tree and chordal filters. We show how different network filters incrementally obtain various clusters along this spectrum to maintain structural and domain-relevant components of the original network, while reducing noise. We test the proposed approach using gene expression levels obtained from diabetes and yeast datasets and compare the filtered networks with original networks using ontology enrichment. The obtained results support our main hypothesis that the filters conserve important elements from the original networks while uncovering new biologically significant clusters. However, results analyzing maintained and uncovered biologically significant hubs were inconclusive.",
keywords = "Clusters, Correlation networks, Gene expressions, Hubs, Ontology enrichment, Systems biology",
author = "Sean West and Cooper, {Kathryn M} and Sanjukta Bhowmick and Ali, {Hesham H}",
year = "2013",
doi = "10.1109/ICDMW.2013.147",
language = "English (US)",
pages = "584--591",
booktitle = "Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013",
publisher = "IEEE Computer Society",

}

TY - GEN

T1 - Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment

AU - West, Sean

AU - Cooper, Kathryn M

AU - Bhowmick, Sanjukta

AU - Ali, Hesham H

PY - 2013

Y1 - 2013

N2 - Since the explosive influx of biological data obtained from high-throughput medical instruments, the ability to leverage the currently available data to extract useful knowledge has become one of the most challenging problems in biomedical research. The analysis of such data is particularly complex not only due to its massive size but also due to its heterogeneity and inherent noise associated with several data gathering steps. The utilization of biological networks to model and integrate large-scale heterogeneous biomedical data continues to grow, especially with the systems biology approach taking center stage in many bioinformatics applications. Although loaded with biologically relevant signals, correlation networks do contain noise and are too large for simple data mining tools. In this project, we implement different types of filters to reduce the network size and sort out signals from noise. We propose a new approach for generating various filters that iterate on sub graphs along a spectrum between spanning tree and chordal filters. We show how different network filters incrementally obtain various clusters along this spectrum to maintain structural and domain-relevant components of the original network, while reducing noise. We test the proposed approach using gene expression levels obtained from diabetes and yeast datasets and compare the filtered networks with original networks using ontology enrichment. The obtained results support our main hypothesis that the filters conserve important elements from the original networks while uncovering new biologically significant clusters. However, results analyzing maintained and uncovered biologically significant hubs were inconclusive.

AB - Since the explosive influx of biological data obtained from high-throughput medical instruments, the ability to leverage the currently available data to extract useful knowledge has become one of the most challenging problems in biomedical research. The analysis of such data is particularly complex not only due to its massive size but also due to its heterogeneity and inherent noise associated with several data gathering steps. The utilization of biological networks to model and integrate large-scale heterogeneous biomedical data continues to grow, especially with the systems biology approach taking center stage in many bioinformatics applications. Although loaded with biologically relevant signals, correlation networks do contain noise and are too large for simple data mining tools. In this project, we implement different types of filters to reduce the network size and sort out signals from noise. We propose a new approach for generating various filters that iterate on sub graphs along a spectrum between spanning tree and chordal filters. We show how different network filters incrementally obtain various clusters along this spectrum to maintain structural and domain-relevant components of the original network, while reducing noise. We test the proposed approach using gene expression levels obtained from diabetes and yeast datasets and compare the filtered networks with original networks using ontology enrichment. The obtained results support our main hypothesis that the filters conserve important elements from the original networks while uncovering new biologically significant clusters. However, results analyzing maintained and uncovered biologically significant hubs were inconclusive.

KW - Clusters

KW - Correlation networks

KW - Gene expressions

KW - Hubs

KW - Ontology enrichment

KW - Systems biology

UR - http://www.scopus.com/inward/record.url?scp=84898037983&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84898037983&partnerID=8YFLogxK

U2 - 10.1109/ICDMW.2013.147

DO - 10.1109/ICDMW.2013.147

M3 - Conference contribution

SP - 584

EP - 591

BT - Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013

PB - IEEE Computer Society

ER -