De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences

Mnirnal D. Maudhoo, Jacob D. Madison, Robert B Norgren

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Background: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, " Clint" , to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species.Findings: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database.Conclusions: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome.

Original languageEnglish (US)
Article number18
JournalGigaScience
Volume4
Issue number1
DOIs
StatePublished - Apr 18 2015

Fingerprint

Pan troglodytes
Transcriptome
Genes
Messenger RNA
Pan paniscus
Biotechnology
RNA
Muscle
Information Centers
Nucleotides
Genome
Skin
Firearms
Tissue
Proteins
Vascular Smooth Muscle
Adipose Tissue
Cultured Cells
Skeletal Muscle
Databases

Keywords

  • Assembly
  • Chimpanzee
  • Pan troglodytes
  • Transcriptome
  • mRNA-seq

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Cite this

De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences. / Maudhoo, Mnirnal D.; Madison, Jacob D.; Norgren, Robert B.

In: GigaScience, Vol. 4, No. 1, 18, 18.04.2015.

Research output: Contribution to journalArticle

Maudhoo, Mnirnal D. ; Madison, Jacob D. ; Norgren, Robert B. / De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences. In: GigaScience. 2015 ; Vol. 4, No. 1.
@article{9d6b66e8ac8a45deb36398d649aa69bb,
title = "De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences",
abstract = "Background: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, {"} Clint{"} , to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species.Findings: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database.Conclusions: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome.",
keywords = "Assembly, Chimpanzee, Pan troglodytes, Transcriptome, mRNA-seq",
author = "Maudhoo, {Mnirnal D.} and Madison, {Jacob D.} and Norgren, {Robert B}",
year = "2015",
month = "4",
day = "18",
doi = "10.1186/s13742-015-0061-x",
language = "English (US)",
volume = "4",
journal = "GigaScience",
issn = "2047-217X",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences

AU - Maudhoo, Mnirnal D.

AU - Madison, Jacob D.

AU - Norgren, Robert B

PY - 2015/4/18

Y1 - 2015/4/18

N2 - Background: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, " Clint" , to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species.Findings: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database.Conclusions: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome.

AB - Background: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, " Clint" , to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species.Findings: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database.Conclusions: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome.

KW - Assembly

KW - Chimpanzee

KW - Pan troglodytes

KW - Transcriptome

KW - mRNA-seq

UR - http://www.scopus.com/inward/record.url?scp=84945497664&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84945497664&partnerID=8YFLogxK

U2 - 10.1186/s13742-015-0061-x

DO - 10.1186/s13742-015-0061-x

M3 - Article

C2 - 25897398

AN - SCOPUS:84945497664

VL - 4

JO - GigaScience

JF - GigaScience

SN - 2047-217X

IS - 1

M1 - 18

ER -