Abstract
Background: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, " Clint" , to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species.Findings: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database.Conclusions: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome.
Original language | English (US) |
---|---|
Article number | 18 |
Journal | GigaScience |
Volume | 4 |
Issue number | 1 |
DOIs | |
State | Published - Apr 18 2015 |
Keywords
- Assembly
- Chimpanzee
- Pan troglodytes
- Transcriptome
- mRNA-seq
ASJC Scopus subject areas
- General Medicine