A genome signature based on Markov modeling

Jian Li, Khalid Sayood

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

We propose a "genome signature" for bacterial genomes based on a triplets Markov model. Without the alignment or data preprocessing required by traditional analysis methods, the model is shown to efficiently capture identifying genomic information at both species and strain levels. Based on the model, a simple assumption-free distance measure is proposed for constructing phytogeny trees. The approach avoids problems with word frequency approaches such as balancing word length and window size. The method is shown to work successfully with both bacterial whole genome data and individual eukaryotic genes. Application of the model to phylogenetic analysis is presented.

Original languageEnglish (US)
Title of host publication2005 IEEE International Conference on Electro Information Technology
StatePublished - 2005
Event2005 IEEE International Conference on Electro Information Technology - Lincoln, NE, United States
Duration: May 22 2005May 25 2005

Publication series

Name2005 IEEE International Conference on Electro Information Technology
Volume2005

Conference

Conference2005 IEEE International Conference on Electro Information Technology
Country/TerritoryUnited States
CityLincoln, NE
Period5/22/055/25/05

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'A genome signature based on Markov modeling'. Together they form a unique fingerprint.

Cite this