A new clustering strategy with stochastic merging and removing based on kernel functions

Huimin Geng, Hesham Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

With hierarchical clustering methods, divisions or fusions, once made, are irrevocable. As a result, when two elements in a bottom-up algorithm are assigned to one cluster, they cannot subsequently be separated. Also, when a top-down algorithm separates two elements, they can't be rejoined. Such greedy property may lead to premature convergence and consequently lead to a clustering that is far from optimal. To overcome this problem, we propose a new Stochastic Message Passing Clustering (SMPC) method based on the Message Passing Clustering (MPC) algorithm introduced in our earlier work [1]. SMPC, as a generalized version of MPC, extends the clustering algorithm from a deterministic process to a stochastic process, adding two major advantages. First, in deciding the merging cluster pair, the influences of all clusters are quantified by probabilities, estimated by kernel functions based on their relative distances. Secondly, clustering can be undone to improve the clustering performance when the algorithm detects elements which don't have good probabilities inside the cluster and moves them outside. The test results on colon cancer gene-expression data show that SMPC performs better than the deterministic MPC or hierarchical clustering method.

Original languageEnglish (US)
Title of host publication2005 IEEE Computational Systems Bioinformatics Conference, Workshops and Poster Abstracts
Pages41-42
Number of pages2
DOIs
StatePublished - 2005
Event2005 IEEE Computational Systems Bioinformatics Conference, Workshops and Poster Abstracts - Stanford, CA, United States
Duration: Aug 8 2005Aug 11 2005

Publication series

Name2005 IEEE Computational Systems Bioinformatics Conference, Workshops and Poster Abstracts

Conference

Conference2005 IEEE Computational Systems Bioinformatics Conference, Workshops and Poster Abstracts
Country/TerritoryUnited States
CityStanford, CA
Period8/8/058/11/05

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint

Dive into the research topics of 'A new clustering strategy with stochastic merging and removing based on kernel functions'. Together they form a unique fingerprint.

Cite this