ngLOC: An n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes

Brian R. King, Chittibabu Guda

Research output: Contribution to journalArticle

63 Scopus citations

Abstract

We present a method called ngLOC, an n-gram-based Bayesian classifier that predicts the localization of a protein sequence over ten distinct subcellular organelles. A tenfold cross-validation result shows an accuracy of 89% for sequences localized to a single organelle, and 82% for those localized to multiple organelles. An enhanced version of ngLOC was developed to estimate the subcellular proteomes of eight eukaryotic organisms: yeast, nematode, fruitfly, mosquito, zebrafish, chicken, mouse, and human.

Original languageEnglish (US)
Article numberR68
JournalGenome biology
Volume8
Issue number5
DOIs
StatePublished - May 1 2007

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Genetics
  • Cell Biology

Fingerprint Dive into the research topics of 'ngLOC: An n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes'. Together they form a unique fingerprint.

  • Cite this