Predicting yeast gene function based on hidden markov models

Xutao Deng, Huimin Geng, Hesham Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

The prediction of function classes for unannotated genes or Open Reading Frames (ORFs) is important for understanding the function role of genes and gene networks. Existing data mining tools, such as Support Vector Machines (SVMs) and K-Nearest Neighbors (KNNs), can only achieve about 40% precision. We developed a gene function prediction tool based on profile Hidden Markov Models (HMMs). HMMs have shown great successes in modeling noisy sequential data sets in speech recognition and protein sequence profiling. Results from contingency test showed significant Markov dependency in time-series expression data, and therefore HMMs would be especially appropriate for modeling gene expressions. Each function class is associated with a distinct HMM whose parameters are trained using yeast time-series gene expression data. The function annotations of the HMM training set were obtained from the Munich Information Centre for Protein Sequences (MIPS) data base. We designed two structural variants of HMMs (chain HMM, split HMM) and tested each of them on 40 function classes. The highest overall prediction precision achieved was 67% using double-split HMM with n-fold cross-validation. We also attempted to generalize HMMs to Dynamic Bayesian Networks (DBNs) for gene function prediction using heterogeneous data sets.

Original languageEnglish (US)
Title of host publication20th International Conference on Computers and Their Applications 2005, CATA 2005
Pages196-201
Number of pages6
StatePublished - 2005
Event20th International Conference on Computers and Their Applications 2005, CATA 2005 - New Orleans, LA, United States
Duration: Mar 16 2005Mar 18 2005

Publication series

Name20th International Conference on Computers and Their Applications 2005, CATA 2005

Conference

Conference20th International Conference on Computers and Their Applications 2005, CATA 2005
Country/TerritoryUnited States
CityNew Orleans, LA
Period3/16/053/18/05

Keywords

  • Function prediction
  • Gene expression
  • Hidden markov model

ASJC Scopus subject areas

  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Predicting yeast gene function based on hidden markov models'. Together they form a unique fingerprint.

Cite this