TY - JOUR
T1 - A method of precise mRNA/DNA homology-based gene structure prediction
AU - Churbanov, Alexander
AU - Pauley, Mark
AU - Quest, Daniel
AU - Ali, Hesham
PY - 2005/10/21
Y1 - 2005/10/21
N2 - Background: Accurate and automatic gene finding and structural prediction is a common problem in bioinformatics, and applications need to be capable of handling non-canonical splice sites, micro-exons and partial gene structure predictions that span across several genomic clones. Results: We present a mRNA/DNA homology based gene structure prediction tool, GIGOgene. We use a new affine gap penalty splice-enhanced global alignment algorithm running in linear memory for a high quality annotation of splice sites. Our tool includes a novel algorithm to assemble partial gene structure predictions using interval graphs. GIGOgene exhibited a sensitivity of 99.08% and a specificity of 99.98% on the Genie learning set, and demonstrated a higher quality of gene structural prediction when compared to Sim4, est2genome, Spidey, Galahad and BLAT, including when genes contained micro-exons and non-canonical splice sites. GIGOgene showed an acceptable loss of prediction quality when confronted with a noisy Genie learning set simulating ESTs. Conclusions: GIGOgene shows a higher quality of gene structure prediction for mRNA/DNA spliced alignment when compared to other available tools.
AB - Background: Accurate and automatic gene finding and structural prediction is a common problem in bioinformatics, and applications need to be capable of handling non-canonical splice sites, micro-exons and partial gene structure predictions that span across several genomic clones. Results: We present a mRNA/DNA homology based gene structure prediction tool, GIGOgene. We use a new affine gap penalty splice-enhanced global alignment algorithm running in linear memory for a high quality annotation of splice sites. Our tool includes a novel algorithm to assemble partial gene structure predictions using interval graphs. GIGOgene exhibited a sensitivity of 99.08% and a specificity of 99.98% on the Genie learning set, and demonstrated a higher quality of gene structural prediction when compared to Sim4, est2genome, Spidey, Galahad and BLAT, including when genes contained micro-exons and non-canonical splice sites. GIGOgene showed an acceptable loss of prediction quality when confronted with a noisy Genie learning set simulating ESTs. Conclusions: GIGOgene shows a higher quality of gene structure prediction for mRNA/DNA spliced alignment when compared to other available tools.
UR - http://www.scopus.com/inward/record.url?scp=27744471022&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=27744471022&partnerID=8YFLogxK
U2 - 10.1186/1471-2105-6-261
DO - 10.1186/1471-2105-6-261
M3 - Article
C2 - 16242044
AN - SCOPUS:27744471022
SN - 1471-2105
VL - 6
JO - BMC bioinformatics
JF - BMC bioinformatics
M1 - 261
ER -