Significance analysis of clustering high throughput biological data

Hasan H. Otu, Shakirahmed Koli, Jon Jones, Osman, Towia A. Libermann

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In the post-genomic era, the availability of complete genome sequences has given rise to high throughput systems such as gene chips and protein arrays. These techniques revolutionize our understanding of biology by simultaneously probing thousands of biological entities at any given time. Unsupervised classification and clustering have emerged as important methods of analysis, which can be used to group samples with a similar molecular profile and/or molecules with a similar expression profile. However, techniques like hierarchical clustering, k-means, and self organizing maps (SOM) have been extensively used with little attention to the significance of their results. We propose a general method utilizing bootstrap technique to assign confidence levels to clustering results of high throughput biological data. We apply the proposed method to real genomics and proteomics data regarding Renal Cell Cancer (RCC), which is the most common malignancy of the adult kidney. We utilize protein profiles from IL-2 treatment responders and non-responders among metastatic RCC patients using surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI TOF-MS). We also use gene expression data using Affymetrix HG-U133A chips for primary RCC tumors, inquiring the Union International Contre le Cancer's (UICC) TNM classification.

Original languageEnglish (US)
Title of host publication2005 IEEE International Conference on Electro Information Technology
Publication statusPublished - Dec 1 2005
Event2005 IEEE International Conference on Electro Information Technology - Lincoln, NE, United States
Duration: May 22 2005May 25 2005

Publication series

Name2005 IEEE International Conference on Electro Information Technology
Volume2005

Conference

Conference2005 IEEE International Conference on Electro Information Technology
CountryUnited States
CityLincoln, NE
Period5/22/055/25/05

    Fingerprint

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Otu, H. H., Koli, S., Jones, J., Osman, & Libermann, T. A. (2005). Significance analysis of clustering high throughput biological data. In 2005 IEEE International Conference on Electro Information Technology [1627001] (2005 IEEE International Conference on Electro Information Technology; Vol. 2005).