Quality and quantity of data recovered from massively parallel sequencing: Examples in asparagales and Poaceae

P. Roxanne Steele, Kate L. Hertweck, Dustin Mayfield, Michael R. McKain, James Leebens-Mack, J. Chris Pires

Research output: Contribution to journalArticle

66 Citations (Scopus)

Abstract

Premise of the study: Genome survey sequences (GSS) from massively parallel sequencing have potential to provide large, cost-effective data sets for phylogenetic inference, replace single gene or spacer regions as DNA barcodes, and provide a plethora of data for other comparative molecular evolution studies. Here we report on the application of this method to estimating the molecular phylogeny of core Asparagales, investigating plastid gene losses, assembling complete plastid genomes, and determining the type and quality of assembled genomic data attainable from Illumina 80-120-bp reads. Methods: We sequenced total genomic DNA from samples in two lineages of monocotyledonous plants, Poaceae and Asparagales, on the Illumina platform in a multiplex arrangement. We compared reference-based assemblies to de novo contigs, evaluated consistency of assemblies resulting from use of various references sequences, and assessed our methods to obtain sequence assemblies in nonmodel taxa. Key results: Our method returned reliable, robust organellar and nrDNA sequences in a variety of plant lineages. High quality assemblies are not dependent on genome size, amount of plastid present in the total genomic DNA template, or relatedness of available reference sequences for assembly. Phylogenetic results revealed familial and subfamilial relationships within Asparagales with high bootstrap support, although placement of the monotypic genus Aphyllanthes was placed with moderate confidence. Conclusions: The well-supported molecular phylogeny provides evidence for delineation of subfamilies within core Asparagales. With advances in technology and bioinformatics tools, the use of massively parallel sequencing will continue to become easier and more affordable for phylogenomic and molecular evolutionary biology investigations.

Original languageEnglish (US)
Pages (from-to)330-348
Number of pages19
JournalAmerican Journal of Botany
Volume99
Issue number2
DOIs
StatePublished - Feb 1 2012

Fingerprint

Asparagales
High-Throughput Nucleotide Sequencing
Poaceae
plastid
genomics
Plastids
genome
phylogeny
Phylogeny
DNA
plastids
Plastid Genomes
phylogenetics
Genome Size
bioinformatics
Molecular Evolution
gene
DNA barcoding
evolutionary biology
Computational Biology

Keywords

  • Angiosperms
  • Aphyllanthes
  • Chloroplast
  • DNA barcoding
  • Mitochondria
  • Next-generation sequencing
  • Nuclear ribosomal DNA
  • Phylogenetics
  • Plastid

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Genetics
  • Plant Science

Cite this

Quality and quantity of data recovered from massively parallel sequencing : Examples in asparagales and Poaceae. / Roxanne Steele, P.; Hertweck, Kate L.; Mayfield, Dustin; McKain, Michael R.; Leebens-Mack, James; Chris Pires, J.

In: American Journal of Botany, Vol. 99, No. 2, 01.02.2012, p. 330-348.

Research output: Contribution to journalArticle

Roxanne Steele, P. ; Hertweck, Kate L. ; Mayfield, Dustin ; McKain, Michael R. ; Leebens-Mack, James ; Chris Pires, J. / Quality and quantity of data recovered from massively parallel sequencing : Examples in asparagales and Poaceae. In: American Journal of Botany. 2012 ; Vol. 99, No. 2. pp. 330-348.
@article{30b554041ded476ab47596766e6d13fa,
title = "Quality and quantity of data recovered from massively parallel sequencing: Examples in asparagales and Poaceae",
abstract = "Premise of the study: Genome survey sequences (GSS) from massively parallel sequencing have potential to provide large, cost-effective data sets for phylogenetic inference, replace single gene or spacer regions as DNA barcodes, and provide a plethora of data for other comparative molecular evolution studies. Here we report on the application of this method to estimating the molecular phylogeny of core Asparagales, investigating plastid gene losses, assembling complete plastid genomes, and determining the type and quality of assembled genomic data attainable from Illumina 80-120-bp reads. Methods: We sequenced total genomic DNA from samples in two lineages of monocotyledonous plants, Poaceae and Asparagales, on the Illumina platform in a multiplex arrangement. We compared reference-based assemblies to de novo contigs, evaluated consistency of assemblies resulting from use of various references sequences, and assessed our methods to obtain sequence assemblies in nonmodel taxa. Key results: Our method returned reliable, robust organellar and nrDNA sequences in a variety of plant lineages. High quality assemblies are not dependent on genome size, amount of plastid present in the total genomic DNA template, or relatedness of available reference sequences for assembly. Phylogenetic results revealed familial and subfamilial relationships within Asparagales with high bootstrap support, although placement of the monotypic genus Aphyllanthes was placed with moderate confidence. Conclusions: The well-supported molecular phylogeny provides evidence for delineation of subfamilies within core Asparagales. With advances in technology and bioinformatics tools, the use of massively parallel sequencing will continue to become easier and more affordable for phylogenomic and molecular evolutionary biology investigations.",
keywords = "Angiosperms, Aphyllanthes, Chloroplast, DNA barcoding, Mitochondria, Next-generation sequencing, Nuclear ribosomal DNA, Phylogenetics, Plastid",
author = "{Roxanne Steele}, P. and Hertweck, {Kate L.} and Dustin Mayfield and McKain, {Michael R.} and James Leebens-Mack and {Chris Pires}, J.",
year = "2012",
month = "2",
day = "1",
doi = "10.3732/ajb.1100491",
language = "English (US)",
volume = "99",
pages = "330--348",
journal = "American Journal of Botany",
issn = "0002-9122",
publisher = "Botanical Society of America Inc.",
number = "2",

}

TY - JOUR

T1 - Quality and quantity of data recovered from massively parallel sequencing

T2 - Examples in asparagales and Poaceae

AU - Roxanne Steele, P.

AU - Hertweck, Kate L.

AU - Mayfield, Dustin

AU - McKain, Michael R.

AU - Leebens-Mack, James

AU - Chris Pires, J.

PY - 2012/2/1

Y1 - 2012/2/1

N2 - Premise of the study: Genome survey sequences (GSS) from massively parallel sequencing have potential to provide large, cost-effective data sets for phylogenetic inference, replace single gene or spacer regions as DNA barcodes, and provide a plethora of data for other comparative molecular evolution studies. Here we report on the application of this method to estimating the molecular phylogeny of core Asparagales, investigating plastid gene losses, assembling complete plastid genomes, and determining the type and quality of assembled genomic data attainable from Illumina 80-120-bp reads. Methods: We sequenced total genomic DNA from samples in two lineages of monocotyledonous plants, Poaceae and Asparagales, on the Illumina platform in a multiplex arrangement. We compared reference-based assemblies to de novo contigs, evaluated consistency of assemblies resulting from use of various references sequences, and assessed our methods to obtain sequence assemblies in nonmodel taxa. Key results: Our method returned reliable, robust organellar and nrDNA sequences in a variety of plant lineages. High quality assemblies are not dependent on genome size, amount of plastid present in the total genomic DNA template, or relatedness of available reference sequences for assembly. Phylogenetic results revealed familial and subfamilial relationships within Asparagales with high bootstrap support, although placement of the monotypic genus Aphyllanthes was placed with moderate confidence. Conclusions: The well-supported molecular phylogeny provides evidence for delineation of subfamilies within core Asparagales. With advances in technology and bioinformatics tools, the use of massively parallel sequencing will continue to become easier and more affordable for phylogenomic and molecular evolutionary biology investigations.

AB - Premise of the study: Genome survey sequences (GSS) from massively parallel sequencing have potential to provide large, cost-effective data sets for phylogenetic inference, replace single gene or spacer regions as DNA barcodes, and provide a plethora of data for other comparative molecular evolution studies. Here we report on the application of this method to estimating the molecular phylogeny of core Asparagales, investigating plastid gene losses, assembling complete plastid genomes, and determining the type and quality of assembled genomic data attainable from Illumina 80-120-bp reads. Methods: We sequenced total genomic DNA from samples in two lineages of monocotyledonous plants, Poaceae and Asparagales, on the Illumina platform in a multiplex arrangement. We compared reference-based assemblies to de novo contigs, evaluated consistency of assemblies resulting from use of various references sequences, and assessed our methods to obtain sequence assemblies in nonmodel taxa. Key results: Our method returned reliable, robust organellar and nrDNA sequences in a variety of plant lineages. High quality assemblies are not dependent on genome size, amount of plastid present in the total genomic DNA template, or relatedness of available reference sequences for assembly. Phylogenetic results revealed familial and subfamilial relationships within Asparagales with high bootstrap support, although placement of the monotypic genus Aphyllanthes was placed with moderate confidence. Conclusions: The well-supported molecular phylogeny provides evidence for delineation of subfamilies within core Asparagales. With advances in technology and bioinformatics tools, the use of massively parallel sequencing will continue to become easier and more affordable for phylogenomic and molecular evolutionary biology investigations.

KW - Angiosperms

KW - Aphyllanthes

KW - Chloroplast

KW - DNA barcoding

KW - Mitochondria

KW - Next-generation sequencing

KW - Nuclear ribosomal DNA

KW - Phylogenetics

KW - Plastid

UR - http://www.scopus.com/inward/record.url?scp=84858591050&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84858591050&partnerID=8YFLogxK

U2 - 10.3732/ajb.1100491

DO - 10.3732/ajb.1100491

M3 - Article

C2 - 22291168

AN - SCOPUS:84858591050

VL - 99

SP - 330

EP - 348

JO - American Journal of Botany

JF - American Journal of Botany

SN - 0002-9122

IS - 2

ER -