Protein family classification with discriminant function analysis

Etsuko N. Moriyama, Junhyong Kim

Research output: Chapter in Book/Report/Conference proceedingChapter

11 Scopus citations

Abstract

Rapid progress in multiple genome projects continues to feed databases in the world a large volume of sequence data. In this "post-genomic" era, more efficient and reliable sequence annotation, especially functional annotation of protein sequences, is crucial. Although experimental confirmation is ultimately required, computational annotation of protein sequences has been routinely done, and it is incorporated into major protein databases (e.g., SWISS-PROT: http://www.expasy. org/sprot/, PIR-PSD: http://pir.georgetown.edu/ pirwww/search/textpsd.shtml). Due to a rapidly growing number of new sequences, increasingly more database entries contain only computational annotations. In this paper, we first discuss the disadvantage commonly found in various existing protein classification methods. Next we introduce a set of new methods that can classify protein family sharing very weak similarity. Finally, we describe an algorithm that combines strengths from various protein classification methods to obtain an optimum power for protein classifications.

Original languageEnglish (US)
Title of host publicationGenome Exploitation
Subtitle of host publicationData Mining the Genome
PublisherSpringer US
Pages121-132
Number of pages12
ISBN (Print)038724123X, 9780387241234
DOIs
StatePublished - 2005

ASJC Scopus subject areas

  • Materials Science(all)
  • Chemistry(all)

Fingerprint Dive into the research topics of 'Protein family classification with discriminant function analysis'. Together they form a unique fingerprint.

  • Cite this

    Moriyama, E. N., & Kim, J. (2005). Protein family classification with discriminant function analysis. In Genome Exploitation: Data Mining the Genome (pp. 121-132). Springer US. https://doi.org/10.1007/0-387-24187-6_9