Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Volumes of high-throughput assays been made publicly available. These massive repositories of biological data provide a wealth of information that can harnessed to investigate pressing questions regarding aging and disease. However, there is a distinct imbalance between available data generation techniques and data analysis methodology development. Similar to the four “V’s” of big data, biological data has volume, velocity, heterogeneity, and is prone to error, and as a result methods for analysis of this “biomedical big data” have developed at a slower rate. One promising solution to this multi-dimensional issue are network models, which have emerged as effective tools for analysis as they are capable of representing biological relationships en masse. Here we examine the need for development of standards and workflows in the usage of the correlation network model, where nodes and edges represent correlation between expression pattern in genes. One structure identified as biologically relevant in a correlation network, the gateway node, represents genes that change in co-expression between two different states. In this research, we manipulate parameters used to identify the gateway nodes within a given dataset to determine the consistency of results among network building and clustering approaches. This proof-of-concept is extremely important to investigate as there is a growing pool of methods used for various steps in our network analysis workflow, causing a lack of robustness, consistency, and reproducibility. This research compares the original gateway nodes analysis approach with manipulation in (1) network creation and (2) clustering analysis to test the consistency of structural results in the correlation network. To truly be able to trust these approaches, it must be addressed that even minor changes in approach can have sweeping effects on results. The results of this study allow the authors to call for stronger studies in benchmarking and reproducibility in biomedical “big” data analyses.

Original languageEnglish (US)
Title of host publicationBiomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers
EditorsDirk Elias, Ana Fred, Hugo Gamboa
PublisherSpringer Verlag
Pages224-238
Number of pages15
ISBN (Print)9783319277066
DOIs
StatePublished - Jan 1 2015
Event8th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2015 - Lisbon, Portugal
Duration: Jan 12 2015Jan 15 2015

Publication series

NameCommunications in Computer and Information Science
Volume574
ISSN (Print)1865-0929

Other

Other8th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2015
CountryPortugal
CityLisbon
Period1/12/151/15/15

Fingerprint

Electric network analysis
Aging of materials
Genes
Benchmarking
Assays
Throughput
Big data

Keywords

  • Aging
  • Correlation networks
  • Gateway nodes
  • Robustness
  • SPICi

ASJC Scopus subject areas

  • Computer Science(all)
  • Mathematics(all)

Cite this

Cooper, K. M., Bonasera, S. J., & Ali, H. H. (2015). Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus. In D. Elias, A. Fred, & H. Gamboa (Eds.), Biomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers (pp. 224-238). (Communications in Computer and Information Science; Vol. 574). Springer Verlag. https://doi.org/10.1007/978-3-319-27707-3_14

Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus. / Cooper, Kathryn M; Bonasera, Stephen J; Ali, Hesham H.

Biomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers. ed. / Dirk Elias; Ana Fred; Hugo Gamboa. Springer Verlag, 2015. p. 224-238 (Communications in Computer and Information Science; Vol. 574).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cooper, KM, Bonasera, SJ & Ali, HH 2015, Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus. in D Elias, A Fred & H Gamboa (eds), Biomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers. Communications in Computer and Information Science, vol. 574, Springer Verlag, pp. 224-238, 8th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2015, Lisbon, Portugal, 1/12/15. https://doi.org/10.1007/978-3-319-27707-3_14
Cooper KM, Bonasera SJ, Ali HH. Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus. In Elias D, Fred A, Gamboa H, editors, Biomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers. Springer Verlag. 2015. p. 224-238. (Communications in Computer and Information Science). https://doi.org/10.1007/978-3-319-27707-3_14
Cooper, Kathryn M ; Bonasera, Stephen J ; Ali, Hesham H. / Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus. Biomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers. editor / Dirk Elias ; Ana Fred ; Hugo Gamboa. Springer Verlag, 2015. pp. 224-238 (Communications in Computer and Information Science).
@inproceedings{0a587788f8c7498caf326491d2a12e4a,
title = "Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus",
abstract = "Volumes of high-throughput assays been made publicly available. These massive repositories of biological data provide a wealth of information that can harnessed to investigate pressing questions regarding aging and disease. However, there is a distinct imbalance between available data generation techniques and data analysis methodology development. Similar to the four “V’s” of big data, biological data has volume, velocity, heterogeneity, and is prone to error, and as a result methods for analysis of this “biomedical big data” have developed at a slower rate. One promising solution to this multi-dimensional issue are network models, which have emerged as effective tools for analysis as they are capable of representing biological relationships en masse. Here we examine the need for development of standards and workflows in the usage of the correlation network model, where nodes and edges represent correlation between expression pattern in genes. One structure identified as biologically relevant in a correlation network, the gateway node, represents genes that change in co-expression between two different states. In this research, we manipulate parameters used to identify the gateway nodes within a given dataset to determine the consistency of results among network building and clustering approaches. This proof-of-concept is extremely important to investigate as there is a growing pool of methods used for various steps in our network analysis workflow, causing a lack of robustness, consistency, and reproducibility. This research compares the original gateway nodes analysis approach with manipulation in (1) network creation and (2) clustering analysis to test the consistency of structural results in the correlation network. To truly be able to trust these approaches, it must be addressed that even minor changes in approach can have sweeping effects on results. The results of this study allow the authors to call for stronger studies in benchmarking and reproducibility in biomedical “big” data analyses.",
keywords = "Aging, Correlation networks, Gateway nodes, Robustness, SPICi",
author = "Cooper, {Kathryn M} and Bonasera, {Stephen J} and Ali, {Hesham H}",
year = "2015",
month = "1",
day = "1",
doi = "10.1007/978-3-319-27707-3_14",
language = "English (US)",
isbn = "9783319277066",
series = "Communications in Computer and Information Science",
publisher = "Springer Verlag",
pages = "224--238",
editor = "Dirk Elias and Ana Fred and Hugo Gamboa",
booktitle = "Biomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers",

}

TY - GEN

T1 - Evaluating the robustness of correlation network analysis in the aging mouse hypothalamus

AU - Cooper, Kathryn M

AU - Bonasera, Stephen J

AU - Ali, Hesham H

PY - 2015/1/1

Y1 - 2015/1/1

N2 - Volumes of high-throughput assays been made publicly available. These massive repositories of biological data provide a wealth of information that can harnessed to investigate pressing questions regarding aging and disease. However, there is a distinct imbalance between available data generation techniques and data analysis methodology development. Similar to the four “V’s” of big data, biological data has volume, velocity, heterogeneity, and is prone to error, and as a result methods for analysis of this “biomedical big data” have developed at a slower rate. One promising solution to this multi-dimensional issue are network models, which have emerged as effective tools for analysis as they are capable of representing biological relationships en masse. Here we examine the need for development of standards and workflows in the usage of the correlation network model, where nodes and edges represent correlation between expression pattern in genes. One structure identified as biologically relevant in a correlation network, the gateway node, represents genes that change in co-expression between two different states. In this research, we manipulate parameters used to identify the gateway nodes within a given dataset to determine the consistency of results among network building and clustering approaches. This proof-of-concept is extremely important to investigate as there is a growing pool of methods used for various steps in our network analysis workflow, causing a lack of robustness, consistency, and reproducibility. This research compares the original gateway nodes analysis approach with manipulation in (1) network creation and (2) clustering analysis to test the consistency of structural results in the correlation network. To truly be able to trust these approaches, it must be addressed that even minor changes in approach can have sweeping effects on results. The results of this study allow the authors to call for stronger studies in benchmarking and reproducibility in biomedical “big” data analyses.

AB - Volumes of high-throughput assays been made publicly available. These massive repositories of biological data provide a wealth of information that can harnessed to investigate pressing questions regarding aging and disease. However, there is a distinct imbalance between available data generation techniques and data analysis methodology development. Similar to the four “V’s” of big data, biological data has volume, velocity, heterogeneity, and is prone to error, and as a result methods for analysis of this “biomedical big data” have developed at a slower rate. One promising solution to this multi-dimensional issue are network models, which have emerged as effective tools for analysis as they are capable of representing biological relationships en masse. Here we examine the need for development of standards and workflows in the usage of the correlation network model, where nodes and edges represent correlation between expression pattern in genes. One structure identified as biologically relevant in a correlation network, the gateway node, represents genes that change in co-expression between two different states. In this research, we manipulate parameters used to identify the gateway nodes within a given dataset to determine the consistency of results among network building and clustering approaches. This proof-of-concept is extremely important to investigate as there is a growing pool of methods used for various steps in our network analysis workflow, causing a lack of robustness, consistency, and reproducibility. This research compares the original gateway nodes analysis approach with manipulation in (1) network creation and (2) clustering analysis to test the consistency of structural results in the correlation network. To truly be able to trust these approaches, it must be addressed that even minor changes in approach can have sweeping effects on results. The results of this study allow the authors to call for stronger studies in benchmarking and reproducibility in biomedical “big” data analyses.

KW - Aging

KW - Correlation networks

KW - Gateway nodes

KW - Robustness

KW - SPICi

UR - http://www.scopus.com/inward/record.url?scp=84955287582&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84955287582&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-27707-3_14

DO - 10.1007/978-3-319-27707-3_14

M3 - Conference contribution

AN - SCOPUS:84955287582

SN - 9783319277066

T3 - Communications in Computer and Information Science

SP - 224

EP - 238

BT - Biomedical Engineering Systems and Technologies - 8th International Joint Conference, BIOSTEC 2015, Revised Selected Papers

A2 - Elias, Dirk

A2 - Fred, Ana

A2 - Gamboa, Hugo

PB - Springer Verlag

ER -