PANTHER: A browsable database of gene products organized by biological function, using curated protein family and subfamily classification

Paul D. Thomas, Anish Kejariwal, Michael J. Campbell, Huaiyu Mi, Karen Diemer, Nan Guo, Istvan Ladunga, Betty Ulitsky-Lazareva, Anushya Muruganujan, Steven Rabkin, Jody A. Vandergriff, Oliver Doremieux

Research output: Contribution to journalReview article

449 Scopus citations

Abstract

The PANTHER database was designed for high-throughput analysis of protein sequences. One of the key features is a simplified ontology of protein function, which allows browsing of the database by biological functions. Biologist curators have associated the ontology terms with groups of protein sequences rather than individual sequences. Statistical models (Hidden Markov Models, or HMMs) are built from each of these groups. The advantage of this approach is that new sequences can be automatically classified as they become available. To ensure accurate functional classification, HMMs are constructed not only for families, but also for functionally distinct subfamilies. Multiple sequence alignments and phylogenetic trees, including curator-assigned information, are available for each family. The current version of the PANTHER database includes training sequences from all organisms in the GenBank non-redundant protein database, and the HMMs have been used to classify gene products across the entire genomes of human, and Drosophila melanogaster. PANTHER is publicly available on the web at http://panther.celera.com.

Original languageEnglish (US)
Pages (from-to)334-341
Number of pages8
JournalNucleic acids research
Volume31
Issue number1
DOIs
StatePublished - Jan 1 2003

ASJC Scopus subject areas

  • Genetics

Fingerprint Dive into the research topics of 'PANTHER: A browsable database of gene products organized by biological function, using curated protein family and subfamily classification'. Together they form a unique fingerprint.

  • Cite this

    Thomas, P. D., Kejariwal, A., Campbell, M. J., Mi, H., Diemer, K., Guo, N., Ladunga, I., Ulitsky-Lazareva, B., Muruganujan, A., Rabkin, S., Vandergriff, J. A., & Doremieux, O. (2003). PANTHER: A browsable database of gene products organized by biological function, using curated protein family and subfamily classification. Nucleic acids research, 31(1), 334-341. https://doi.org/10.1093/nar/gkg115