Differential expression analysis in RNA-Seq by a naive bayes classifier with local normalization

Yongchao Dou, Xiaomei Guo, Lingling Yuan, David R. Holding, Chi Zhang

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

To improve the applicability of RNA-seq technology, a large number of RNA-seq data analysis methods and correction algorithms have been developed. Although these new methods and algorithms have steadily improved transcriptome analysis, greater prediction accuracy is needed to better guide experimental designs with computational results. In this study, a new tool for the identification of differentially expressed genes with RNA-seq data, named GExposer, was developed. This tool introduces a local normalization algorithm to reduce the bias of nonrandomly positioned read depth. The naive Bayes classifier is employed to integrate fold change, transcript length, and GC content to identify differentially expressed genes. Results on several independent tests show that GExposer has better performance than other methods. The combination of the local normalization algorithm and naive Bayes classifier with three attributes can achieve better results; both false positive rates and false negative rates are reduced. However, only a small portion of genes is affected by the local normalization and GC content correction.

Original languageEnglish (US)
Article number789516
JournalBioMed research international
Volume2015
DOIs
StatePublished - 2015

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology

Fingerprint

Dive into the research topics of 'Differential expression analysis in RNA-Seq by a naive bayes classifier with local normalization'. Together they form a unique fingerprint.

Cite this