On clustering biological data using unsupervised and semi-supervised message passing

Huimin Geng, Xutao Deng, Dhundy Bastola, Hesham Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Noticing that unsupervised clustering may produce clusters that are irrelevant to the research hypotheses and interests, we generalize traditional unsupervised clustering into semi-supervised clustering based on our previously proposed Message Passing Clustering (MPC). In the semi-supervised MPC, prior knowledge such as instance-level and attribute-level constraints are used to guide the clustering process towards better and interpretable partitions. We applied the unsupervised MPC (null background) to phylogenetic analysis of Mycobacterium and the semi-supervised MPC to colon cancer microarray data analysis. The results show that MPC is superior to the widely accepted neighbor-joining and hierarchical clustering methods, and the semi-supervised MPC is even more powerful in biological data analysis such as gene selection and cancer diagnosis using microarray.

Original languageEnglish (US)
Title of host publicationProceedings - BIBE 2005
Subtitle of host publication5th IEEE Symposium on Bioinformatics and Bioengineering
Pages294-298
Number of pages5
DOIs
StatePublished - 2005
EventBIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering - Minneapolis, MN, United States
Duration: Oct 19 2005Oct 21 2005

Publication series

NameProceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering
Volume2005

Conference

ConferenceBIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering
Country/TerritoryUnited States
CityMinneapolis, MN
Period10/19/0510/21/05

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint

Dive into the research topics of 'On clustering biological data using unsupervised and semi-supervised message passing'. Together they form a unique fingerprint.

Cite this