On clustering biological data using unsupervised and semi-supervised message passing

Huimin Geng, Xutao Deng, Dhundy Raj Bastola, Hesham H Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Noticing that unsupervised clustering may produce clusters that are irrelevant to the research hypotheses and interests, we generalize traditional unsupervised clustering into semi-supervised clustering based on our previously proposed Message Passing Clustering (MPC). In the semi-supervised MPC, prior knowledge such as instance-level and attribute-level constraints are used to guide the clustering process towards better and interpretable partitions. We applied the unsupervised MPC (null background) to phylogenetic analysis of Mycobacterium and the semi-supervised MPC to colon cancer microarray data analysis. The results show that MPC is superior to the widely accepted neighbor-joining and hierarchical clustering methods, and the semi-supervised MPC is even more powerful in biological data analysis such as gene selection and cancer diagnosis using microarray.

Original languageEnglish (US)
Title of host publicationProceedings - BIBE 2005
Subtitle of host publication5th IEEE Symposium on Bioinformatics and Bioengineering
Pages294-298
Number of pages5
DOIs
StatePublished - Dec 1 2005
EventBIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering - Minneapolis, MN, United States
Duration: Oct 19 2005Oct 21 2005

Publication series

NameProceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering
Volume2005

Conference

ConferenceBIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering
CountryUnited States
CityMinneapolis, MN
Period10/19/0510/21/05

Fingerprint

Message passing
Microarrays
Joining
Genes

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Geng, H., Deng, X., Bastola, D. R., & Ali, H. H. (2005). On clustering biological data using unsupervised and semi-supervised message passing. In Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering (pp. 294-298). [1544484] (Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering; Vol. 2005). https://doi.org/10.1109/BIBE.2005.44

On clustering biological data using unsupervised and semi-supervised message passing. / Geng, Huimin; Deng, Xutao; Bastola, Dhundy Raj; Ali, Hesham H.

Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering. 2005. p. 294-298 1544484 (Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering; Vol. 2005).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Geng, H, Deng, X, Bastola, DR & Ali, HH 2005, On clustering biological data using unsupervised and semi-supervised message passing. in Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering., 1544484, Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering, vol. 2005, pp. 294-298, BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering, Minneapolis, MN, United States, 10/19/05. https://doi.org/10.1109/BIBE.2005.44
Geng H, Deng X, Bastola DR, Ali HH. On clustering biological data using unsupervised and semi-supervised message passing. In Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering. 2005. p. 294-298. 1544484. (Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering). https://doi.org/10.1109/BIBE.2005.44
Geng, Huimin ; Deng, Xutao ; Bastola, Dhundy Raj ; Ali, Hesham H. / On clustering biological data using unsupervised and semi-supervised message passing. Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering. 2005. pp. 294-298 (Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering).
@inproceedings{31fc42699b6643dea37480ec416a51c7,
title = "On clustering biological data using unsupervised and semi-supervised message passing",
abstract = "Noticing that unsupervised clustering may produce clusters that are irrelevant to the research hypotheses and interests, we generalize traditional unsupervised clustering into semi-supervised clustering based on our previously proposed Message Passing Clustering (MPC). In the semi-supervised MPC, prior knowledge such as instance-level and attribute-level constraints are used to guide the clustering process towards better and interpretable partitions. We applied the unsupervised MPC (null background) to phylogenetic analysis of Mycobacterium and the semi-supervised MPC to colon cancer microarray data analysis. The results show that MPC is superior to the widely accepted neighbor-joining and hierarchical clustering methods, and the semi-supervised MPC is even more powerful in biological data analysis such as gene selection and cancer diagnosis using microarray.",
author = "Huimin Geng and Xutao Deng and Bastola, {Dhundy Raj} and Ali, {Hesham H}",
year = "2005",
month = "12",
day = "1",
doi = "10.1109/BIBE.2005.44",
language = "English (US)",
isbn = "0769524761",
series = "Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering",
pages = "294--298",
booktitle = "Proceedings - BIBE 2005",

}

TY - GEN

T1 - On clustering biological data using unsupervised and semi-supervised message passing

AU - Geng, Huimin

AU - Deng, Xutao

AU - Bastola, Dhundy Raj

AU - Ali, Hesham H

PY - 2005/12/1

Y1 - 2005/12/1

N2 - Noticing that unsupervised clustering may produce clusters that are irrelevant to the research hypotheses and interests, we generalize traditional unsupervised clustering into semi-supervised clustering based on our previously proposed Message Passing Clustering (MPC). In the semi-supervised MPC, prior knowledge such as instance-level and attribute-level constraints are used to guide the clustering process towards better and interpretable partitions. We applied the unsupervised MPC (null background) to phylogenetic analysis of Mycobacterium and the semi-supervised MPC to colon cancer microarray data analysis. The results show that MPC is superior to the widely accepted neighbor-joining and hierarchical clustering methods, and the semi-supervised MPC is even more powerful in biological data analysis such as gene selection and cancer diagnosis using microarray.

AB - Noticing that unsupervised clustering may produce clusters that are irrelevant to the research hypotheses and interests, we generalize traditional unsupervised clustering into semi-supervised clustering based on our previously proposed Message Passing Clustering (MPC). In the semi-supervised MPC, prior knowledge such as instance-level and attribute-level constraints are used to guide the clustering process towards better and interpretable partitions. We applied the unsupervised MPC (null background) to phylogenetic analysis of Mycobacterium and the semi-supervised MPC to colon cancer microarray data analysis. The results show that MPC is superior to the widely accepted neighbor-joining and hierarchical clustering methods, and the semi-supervised MPC is even more powerful in biological data analysis such as gene selection and cancer diagnosis using microarray.

UR - http://www.scopus.com/inward/record.url?scp=33751171082&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33751171082&partnerID=8YFLogxK

U2 - 10.1109/BIBE.2005.44

DO - 10.1109/BIBE.2005.44

M3 - Conference contribution

SN - 0769524761

SN - 9780769524764

T3 - Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering

SP - 294

EP - 298

BT - Proceedings - BIBE 2005

ER -