TY - GEN
T1 - Evaluating assembly pipeline for transcriptomes
AU - Pavlovikj, Natasha
AU - Bogey, Kevin
AU - Behera, Sairam
AU - Cambell, Malachy
AU - Walia, Harkamal
AU - Deogun, Jitender S.
PY - 2014
Y1 - 2014
N2 - The gene expression and transcriptome analysis are currently some of the main focuses of research for great number of biologists. The assembly of raw sequence data to obtain a draft transcriptome is a multi-stage pipeline consisting of complex steps like data cleaning, contaminant removal, error correction, de novo assembly, redundancy removal, and assembly validation. In this paper we developed and tested five transcriptome assembly pipelines for wheat transcriptome by using different tools. The assemblers Velvet- Oases and Trinity which use multiple and single k- mer methods respectively are used for our pipeline. In addition to that, different tools are used in the process of data cleaning and redundancy removal. The analysis of the results obtained from the five different assemblies show that the transcripts generated from Velvet-Oases with multiple k-mev approach and Oases-M have higher alignment ratio with the reference TriFLDB wheat transcriptome dataset.
AB - The gene expression and transcriptome analysis are currently some of the main focuses of research for great number of biologists. The assembly of raw sequence data to obtain a draft transcriptome is a multi-stage pipeline consisting of complex steps like data cleaning, contaminant removal, error correction, de novo assembly, redundancy removal, and assembly validation. In this paper we developed and tested five transcriptome assembly pipelines for wheat transcriptome by using different tools. The assemblers Velvet- Oases and Trinity which use multiple and single k- mer methods respectively are used for our pipeline. In addition to that, different tools are used in the process of data cleaning and redundancy removal. The analysis of the results obtained from the five different assemblies show that the transcripts generated from Velvet-Oases with multiple k-mev approach and Oases-M have higher alignment ratio with the reference TriFLDB wheat transcriptome dataset.
UR - http://www.scopus.com/inward/record.url?scp=84905827940&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905827940&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84905827940
SN - 9781632665140
T3 - Proceedings of the 6th International Conference on Bioinformatics and Computational Biology, BICOB 2014
SP - 163
EP - 168
BT - Proceedings of the 6th International Conference on Bioinformatics and Computational Biology, BICOB 2014
PB - International Society for Computers and Their Applications
T2 - 6th International Conference on Bioinformatics and Computational Biology, BICOB 2014
Y2 - 24 March 2014 through 26 March 2014
ER -