TY - GEN
T1 - Comparing and optimizing transcriptome assembly pipeline for diploid wheat
AU - Pavlovikj, Natasha
AU - Begcy, Kevin
AU - Behera, Sairam
AU - Campbell, Malachy
AU - Walia, Harkamal
AU - Deogun, Jitender S.
N1 - Publisher Copyright:
Copyright © 2014 ACM.
PY - 2014/9/20
Y1 - 2014/9/20
N2 - Gene expression and transcriptome analysis are currently one of the main focuses of research for a great number of scientists. However, the assembly of raw sequence data to obtain a draft transcriptome of an organism is a complex multi-stage process usually composed of preprocessing, assembling, and postprocessing. Each of these stages includes multiple steps such as data cleaning, contaminant removal, error correction and assembly validation. In order to implement all these steps, a great knowledge of different algorithms, various bioinformatics tools and software is required. In this paper, we generate multiple transcriptome assembly pipelines by using different tools and approaches in the process. Analyzing these pipelines, we can observe that using the error correction method with Velvet Oases and merging the individual k-mer assemblies with highest N50 produce the most stable base for further transcriptome biological analysis.
AB - Gene expression and transcriptome analysis are currently one of the main focuses of research for a great number of scientists. However, the assembly of raw sequence data to obtain a draft transcriptome of an organism is a complex multi-stage process usually composed of preprocessing, assembling, and postprocessing. Each of these stages includes multiple steps such as data cleaning, contaminant removal, error correction and assembly validation. In order to implement all these steps, a great knowledge of different algorithms, various bioinformatics tools and software is required. In this paper, we generate multiple transcriptome assembly pipelines by using different tools and approaches in the process. Analyzing these pipelines, we can observe that using the error correction method with Velvet Oases and merging the individual k-mer assemblies with highest N50 produce the most stable base for further transcriptome biological analysis.
UR - http://www.scopus.com/inward/record.url?scp=84920733670&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84920733670&partnerID=8YFLogxK
U2 - 10.1145/2649387.2662450
DO - 10.1145/2649387.2662450
M3 - Conference contribution
AN - SCOPUS:84920733670
T3 - ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
SP - 603
EP - 604
BT - ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
PB - Association for Computing Machinery
T2 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM BCB 2014
Y2 - 20 September 2014 through 23 September 2014
ER -