On the impact of granularity in extracting knowledge from bioinformatics data

Sean West, Hesham H Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

With the rapidly increasing amount of various types of biological data currently available to researchers, the focus of the biomedical research community has been shifting from pure data generation towards the development of new methodologies for data analytics. Although many researchers continue to focus on approaches developed for analyzing single types of biological data, recent attempts have been made to utilize the availability of heterogeneous data sets that contain various types of data and try to establish tools for data integration and analysis in many bioinformatics applications. Such attempts are expected to increase significantly in this coming decade. While this can be viewed as a positive step towards advancing big data analytics in bioinformatics, it is critical that these integration methodologies are meticulously studied to ensure high quality of the knowledge extracted from the integrated data. In this work, we employ data integration methods to analyze biological data obtained from protein interaction networks and gene expression data. We conduct a study to show that potential problems can arise from integrating or fusing data obtained at different granularity levels and highlight the importance of developing advanced data fusing techniques to integrate various types of biological data for analytical purposes. Further, we explore the impact of granularity from a more formulized approach and the granularity levels significantly impact the quality of knowledge extracted from the integrated data.

Original languageEnglish (US)
Title of host publicationBIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016
PublisherSciTePress
Pages92-103
Number of pages12
ISBN (Print)9789897581700
StatePublished - 2016
Event7th International Conference on Bioinformatics Models, Methods and Algorithms, BIOINFORMATICS 2016 - Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016 - Rome, Italy
Duration: Feb 21 2016Feb 23 2016

Other

Other7th International Conference on Bioinformatics Models, Methods and Algorithms, BIOINFORMATICS 2016 - Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016
CountryItaly
CityRome
Period2/21/162/23/16

Fingerprint

Data integration
Bioinformatics
Gene expression
Availability
Proteins
Big data

Keywords

  • And clusters
  • Co-regulation
  • Correlation networks
  • Data integration
  • Gene expression data
  • Knowledge extraction
  • Protein-protein interaction

ASJC Scopus subject areas

  • Biomedical Engineering
  • Electrical and Electronic Engineering

Cite this

West, S., & Ali, H. H. (2016). On the impact of granularity in extracting knowledge from bioinformatics data. In BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016 (pp. 92-103). SciTePress.

On the impact of granularity in extracting knowledge from bioinformatics data. / West, Sean; Ali, Hesham H.

BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016. SciTePress, 2016. p. 92-103.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

West, S & Ali, HH 2016, On the impact of granularity in extracting knowledge from bioinformatics data. in BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016. SciTePress, pp. 92-103, 7th International Conference on Bioinformatics Models, Methods and Algorithms, BIOINFORMATICS 2016 - Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016, Rome, Italy, 2/21/16.
West S, Ali HH. On the impact of granularity in extracting knowledge from bioinformatics data. In BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016. SciTePress. 2016. p. 92-103
West, Sean ; Ali, Hesham H. / On the impact of granularity in extracting knowledge from bioinformatics data. BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016. SciTePress, 2016. pp. 92-103
@inproceedings{dc1c135551f7451b8cb6a4648065a441,
title = "On the impact of granularity in extracting knowledge from bioinformatics data",
abstract = "With the rapidly increasing amount of various types of biological data currently available to researchers, the focus of the biomedical research community has been shifting from pure data generation towards the development of new methodologies for data analytics. Although many researchers continue to focus on approaches developed for analyzing single types of biological data, recent attempts have been made to utilize the availability of heterogeneous data sets that contain various types of data and try to establish tools for data integration and analysis in many bioinformatics applications. Such attempts are expected to increase significantly in this coming decade. While this can be viewed as a positive step towards advancing big data analytics in bioinformatics, it is critical that these integration methodologies are meticulously studied to ensure high quality of the knowledge extracted from the integrated data. In this work, we employ data integration methods to analyze biological data obtained from protein interaction networks and gene expression data. We conduct a study to show that potential problems can arise from integrating or fusing data obtained at different granularity levels and highlight the importance of developing advanced data fusing techniques to integrate various types of biological data for analytical purposes. Further, we explore the impact of granularity from a more formulized approach and the granularity levels significantly impact the quality of knowledge extracted from the integrated data.",
keywords = "And clusters, Co-regulation, Correlation networks, Data integration, Gene expression data, Knowledge extraction, Protein-protein interaction",
author = "Sean West and Ali, {Hesham H}",
year = "2016",
language = "English (US)",
isbn = "9789897581700",
pages = "92--103",
booktitle = "BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016",
publisher = "SciTePress",

}

TY - GEN

T1 - On the impact of granularity in extracting knowledge from bioinformatics data

AU - West, Sean

AU - Ali, Hesham H

PY - 2016

Y1 - 2016

N2 - With the rapidly increasing amount of various types of biological data currently available to researchers, the focus of the biomedical research community has been shifting from pure data generation towards the development of new methodologies for data analytics. Although many researchers continue to focus on approaches developed for analyzing single types of biological data, recent attempts have been made to utilize the availability of heterogeneous data sets that contain various types of data and try to establish tools for data integration and analysis in many bioinformatics applications. Such attempts are expected to increase significantly in this coming decade. While this can be viewed as a positive step towards advancing big data analytics in bioinformatics, it is critical that these integration methodologies are meticulously studied to ensure high quality of the knowledge extracted from the integrated data. In this work, we employ data integration methods to analyze biological data obtained from protein interaction networks and gene expression data. We conduct a study to show that potential problems can arise from integrating or fusing data obtained at different granularity levels and highlight the importance of developing advanced data fusing techniques to integrate various types of biological data for analytical purposes. Further, we explore the impact of granularity from a more formulized approach and the granularity levels significantly impact the quality of knowledge extracted from the integrated data.

AB - With the rapidly increasing amount of various types of biological data currently available to researchers, the focus of the biomedical research community has been shifting from pure data generation towards the development of new methodologies for data analytics. Although many researchers continue to focus on approaches developed for analyzing single types of biological data, recent attempts have been made to utilize the availability of heterogeneous data sets that contain various types of data and try to establish tools for data integration and analysis in many bioinformatics applications. Such attempts are expected to increase significantly in this coming decade. While this can be viewed as a positive step towards advancing big data analytics in bioinformatics, it is critical that these integration methodologies are meticulously studied to ensure high quality of the knowledge extracted from the integrated data. In this work, we employ data integration methods to analyze biological data obtained from protein interaction networks and gene expression data. We conduct a study to show that potential problems can arise from integrating or fusing data obtained at different granularity levels and highlight the importance of developing advanced data fusing techniques to integrate various types of biological data for analytical purposes. Further, we explore the impact of granularity from a more formulized approach and the granularity levels significantly impact the quality of knowledge extracted from the integrated data.

KW - And clusters

KW - Co-regulation

KW - Correlation networks

KW - Data integration

KW - Gene expression data

KW - Knowledge extraction

KW - Protein-protein interaction

UR - http://www.scopus.com/inward/record.url?scp=84969219431&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84969219431&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9789897581700

SP - 92

EP - 103

BT - BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 9th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2016

PB - SciTePress

ER -