TY - GEN
T1 - Message passing clustering with stochastic merging based on kernel functions
AU - Geng, Huimin
AU - Deng, Xutao
AU - Ali, Hesham H.
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2005
Y1 - 2005
N2 - In this paper, we propose a new Stochastic Message Passing Clustering (SMPC) algorithm for clustering biological data based on the Message Passing Clustering (MPC) algorithm, which we introduced in earlier work. MPC has shown its advantage when applied to describing parallel and spontaneous biological processes. SMPC, as a generalized version of MPC, extends the clustering algorithm from a deterministic process to a stochastic process, adding three major advantages. First, in deciding the merging cluster pair, the influences of all clusters are quantified by probabilities, estimated by kernel functions based on their relative distances. Second, the proposed algorithm property resolve the "tie" problem, which often occurs for integer distances as in the case of protein interaction data. Third, clustering can be undone to improve the clustering performance when the algorithm detects objects which don't have good probabilities inside the cluster and moves them outside. The test results on colon cancer gene-expression data show that SMPC performs better than the deterministic MPC.
AB - In this paper, we propose a new Stochastic Message Passing Clustering (SMPC) algorithm for clustering biological data based on the Message Passing Clustering (MPC) algorithm, which we introduced in earlier work. MPC has shown its advantage when applied to describing parallel and spontaneous biological processes. SMPC, as a generalized version of MPC, extends the clustering algorithm from a deterministic process to a stochastic process, adding three major advantages. First, in deciding the merging cluster pair, the influences of all clusters are quantified by probabilities, estimated by kernel functions based on their relative distances. Second, the proposed algorithm property resolve the "tie" problem, which often occurs for integer distances as in the case of protein interaction data. Third, clustering can be undone to improve the clustering performance when the algorithm detects objects which don't have good probabilities inside the cluster and moves them outside. The test results on colon cancer gene-expression data show that SMPC performs better than the deterministic MPC.
KW - Clustering
KW - Hierarchical clustering
KW - Kernel functions
KW - Message passing clustering
KW - Stochastic processes
UR - http://www.scopus.com/inward/record.url?scp=84870045415&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84870045415&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84870045415
SN - 9781604235531
T3 - Association for Information Systems - 11th Americas Conference on Information Systems, AMCIS 2005: A Conference on a Human Scale
SP - 2662
EP - 2671
BT - Association for Information Systems - 11th Americas Conference on Information Systems, AMCIS 2005
T2 - 11th Americas Conference on Information Systems, AMCIS 2005
Y2 - 11 August 2005 through 15 August 2005
ER -