Metabolic reconstruction for metagenomic data and its application to the human microbiome

Sahar Abubucker, Nicola Segata, Johannes Goll, Alyxandria M. Schubert, Jacques Izard, Brandi L. Cantarel, Beltran Rodriguez-Mueller, Jeremy Zucker, Mathangi Thiagarajan, Bernard Henrissat, Owen White, Scott T. Kelley, Barbara Methé, Patrick D. Schloss, Dirk Gevers, Makedonka Mitreva, Curtis Huttenhower

Research output: Contribution to journalArticle

486 Citations (Scopus)

Abstract

Microbial communities carry out the majority of the biochemical activity on the planet, and they play integral roles in processes including metabolism and immune homeostasis in the human microbiome. Shotgun sequencing of such communities' metagenomes provides information complementary to organismal abundances from taxonomic markers, but the resulting data typically comprise short reads from hundreds of different organisms and are at best challenging to assemble comparably to single-organism genomes. Here, we describe an alternative approach to infer the functional and metabolic potential of a microbial community metagenome. We determined the gene families and pathways present or absent within a community, as well as their relative abundances, directly from short sequence reads. We validated this methodology using a collection of synthetic metagenomes, recovering the presence and abundance both of large pathways and of small functional modules with high accuracy. We subsequently applied this method, HUMAnN, to the microbial communities of 649 metagenomes drawn from seven primary body sites on 102 individuals as part of the Human Microbiome Project (HMP). This provided a means to compare functional diversity and organismal ecology in the human microbiome, and we determined a core of 24 ubiquitously present modules. Core pathways were often implemented by different enzyme families within different body sites, and 168 functional modules and 196 metabolic pathways varied in metagenomic abundance specifically to one or more niches within the microbiome. These included glycosaminoglycan degradation in the gut, as well as phosphate and amino acid transport linked to host phenotype (vaginal pH) in the posterior fornix. An implementation of our methodology is available at http://huttenhower.sph.harvard.edu/humann. This provides a means to accurately and efficiently characterize microbial metabolic pathways and functional modules directly from high-throughput sequencing reads, enabling the determination of community roles in the HMP cohort and in future metagenomic studies.

Original languageEnglish (US)
Article numbere1002358
JournalPLoS Computational Biology
Volume8
Issue number6
DOIs
StatePublished - Jun 1 2012

Fingerprint

Metagenomics
Metagenome
Microbiota
microbial community
Genes
Pathway
Planets
Ecology
Metabolism
microbial communities
Amino acids
Module
Metabolic Networks and Pathways
Phosphates
Enzymes
methodology
Throughput
homeostasis
Degradation
Sequencing

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Modeling and Simulation
  • Ecology
  • Molecular Biology
  • Genetics
  • Cellular and Molecular Neuroscience
  • Computational Theory and Mathematics

Cite this

Abubucker, S., Segata, N., Goll, J., Schubert, A. M., Izard, J., Cantarel, B. L., ... Huttenhower, C. (2012). Metabolic reconstruction for metagenomic data and its application to the human microbiome. PLoS Computational Biology, 8(6), [e1002358]. https://doi.org/10.1371/journal.pcbi.1002358

Metabolic reconstruction for metagenomic data and its application to the human microbiome. / Abubucker, Sahar; Segata, Nicola; Goll, Johannes; Schubert, Alyxandria M.; Izard, Jacques; Cantarel, Brandi L.; Rodriguez-Mueller, Beltran; Zucker, Jeremy; Thiagarajan, Mathangi; Henrissat, Bernard; White, Owen; Kelley, Scott T.; Methé, Barbara; Schloss, Patrick D.; Gevers, Dirk; Mitreva, Makedonka; Huttenhower, Curtis.

In: PLoS Computational Biology, Vol. 8, No. 6, e1002358, 01.06.2012.

Research output: Contribution to journalArticle

Abubucker, S, Segata, N, Goll, J, Schubert, AM, Izard, J, Cantarel, BL, Rodriguez-Mueller, B, Zucker, J, Thiagarajan, M, Henrissat, B, White, O, Kelley, ST, Methé, B, Schloss, PD, Gevers, D, Mitreva, M & Huttenhower, C 2012, 'Metabolic reconstruction for metagenomic data and its application to the human microbiome', PLoS Computational Biology, vol. 8, no. 6, e1002358. https://doi.org/10.1371/journal.pcbi.1002358
Abubucker, Sahar ; Segata, Nicola ; Goll, Johannes ; Schubert, Alyxandria M. ; Izard, Jacques ; Cantarel, Brandi L. ; Rodriguez-Mueller, Beltran ; Zucker, Jeremy ; Thiagarajan, Mathangi ; Henrissat, Bernard ; White, Owen ; Kelley, Scott T. ; Methé, Barbara ; Schloss, Patrick D. ; Gevers, Dirk ; Mitreva, Makedonka ; Huttenhower, Curtis. / Metabolic reconstruction for metagenomic data and its application to the human microbiome. In: PLoS Computational Biology. 2012 ; Vol. 8, No. 6.
@article{aae21d0eaeed44a5ae61197e88e71771,
title = "Metabolic reconstruction for metagenomic data and its application to the human microbiome",
abstract = "Microbial communities carry out the majority of the biochemical activity on the planet, and they play integral roles in processes including metabolism and immune homeostasis in the human microbiome. Shotgun sequencing of such communities' metagenomes provides information complementary to organismal abundances from taxonomic markers, but the resulting data typically comprise short reads from hundreds of different organisms and are at best challenging to assemble comparably to single-organism genomes. Here, we describe an alternative approach to infer the functional and metabolic potential of a microbial community metagenome. We determined the gene families and pathways present or absent within a community, as well as their relative abundances, directly from short sequence reads. We validated this methodology using a collection of synthetic metagenomes, recovering the presence and abundance both of large pathways and of small functional modules with high accuracy. We subsequently applied this method, HUMAnN, to the microbial communities of 649 metagenomes drawn from seven primary body sites on 102 individuals as part of the Human Microbiome Project (HMP). This provided a means to compare functional diversity and organismal ecology in the human microbiome, and we determined a core of 24 ubiquitously present modules. Core pathways were often implemented by different enzyme families within different body sites, and 168 functional modules and 196 metabolic pathways varied in metagenomic abundance specifically to one or more niches within the microbiome. These included glycosaminoglycan degradation in the gut, as well as phosphate and amino acid transport linked to host phenotype (vaginal pH) in the posterior fornix. An implementation of our methodology is available at http://huttenhower.sph.harvard.edu/humann. This provides a means to accurately and efficiently characterize microbial metabolic pathways and functional modules directly from high-throughput sequencing reads, enabling the determination of community roles in the HMP cohort and in future metagenomic studies.",
author = "Sahar Abubucker and Nicola Segata and Johannes Goll and Schubert, {Alyxandria M.} and Jacques Izard and Cantarel, {Brandi L.} and Beltran Rodriguez-Mueller and Jeremy Zucker and Mathangi Thiagarajan and Bernard Henrissat and Owen White and Kelley, {Scott T.} and Barbara Meth{\'e} and Schloss, {Patrick D.} and Dirk Gevers and Makedonka Mitreva and Curtis Huttenhower",
year = "2012",
month = "6",
day = "1",
doi = "10.1371/journal.pcbi.1002358",
language = "English (US)",
volume = "8",
journal = "PLoS Computational Biology",
issn = "1553-734X",
publisher = "Public Library of Science",
number = "6",

}

TY - JOUR

T1 - Metabolic reconstruction for metagenomic data and its application to the human microbiome

AU - Abubucker, Sahar

AU - Segata, Nicola

AU - Goll, Johannes

AU - Schubert, Alyxandria M.

AU - Izard, Jacques

AU - Cantarel, Brandi L.

AU - Rodriguez-Mueller, Beltran

AU - Zucker, Jeremy

AU - Thiagarajan, Mathangi

AU - Henrissat, Bernard

AU - White, Owen

AU - Kelley, Scott T.

AU - Methé, Barbara

AU - Schloss, Patrick D.

AU - Gevers, Dirk

AU - Mitreva, Makedonka

AU - Huttenhower, Curtis

PY - 2012/6/1

Y1 - 2012/6/1

N2 - Microbial communities carry out the majority of the biochemical activity on the planet, and they play integral roles in processes including metabolism and immune homeostasis in the human microbiome. Shotgun sequencing of such communities' metagenomes provides information complementary to organismal abundances from taxonomic markers, but the resulting data typically comprise short reads from hundreds of different organisms and are at best challenging to assemble comparably to single-organism genomes. Here, we describe an alternative approach to infer the functional and metabolic potential of a microbial community metagenome. We determined the gene families and pathways present or absent within a community, as well as their relative abundances, directly from short sequence reads. We validated this methodology using a collection of synthetic metagenomes, recovering the presence and abundance both of large pathways and of small functional modules with high accuracy. We subsequently applied this method, HUMAnN, to the microbial communities of 649 metagenomes drawn from seven primary body sites on 102 individuals as part of the Human Microbiome Project (HMP). This provided a means to compare functional diversity and organismal ecology in the human microbiome, and we determined a core of 24 ubiquitously present modules. Core pathways were often implemented by different enzyme families within different body sites, and 168 functional modules and 196 metabolic pathways varied in metagenomic abundance specifically to one or more niches within the microbiome. These included glycosaminoglycan degradation in the gut, as well as phosphate and amino acid transport linked to host phenotype (vaginal pH) in the posterior fornix. An implementation of our methodology is available at http://huttenhower.sph.harvard.edu/humann. This provides a means to accurately and efficiently characterize microbial metabolic pathways and functional modules directly from high-throughput sequencing reads, enabling the determination of community roles in the HMP cohort and in future metagenomic studies.

AB - Microbial communities carry out the majority of the biochemical activity on the planet, and they play integral roles in processes including metabolism and immune homeostasis in the human microbiome. Shotgun sequencing of such communities' metagenomes provides information complementary to organismal abundances from taxonomic markers, but the resulting data typically comprise short reads from hundreds of different organisms and are at best challenging to assemble comparably to single-organism genomes. Here, we describe an alternative approach to infer the functional and metabolic potential of a microbial community metagenome. We determined the gene families and pathways present or absent within a community, as well as their relative abundances, directly from short sequence reads. We validated this methodology using a collection of synthetic metagenomes, recovering the presence and abundance both of large pathways and of small functional modules with high accuracy. We subsequently applied this method, HUMAnN, to the microbial communities of 649 metagenomes drawn from seven primary body sites on 102 individuals as part of the Human Microbiome Project (HMP). This provided a means to compare functional diversity and organismal ecology in the human microbiome, and we determined a core of 24 ubiquitously present modules. Core pathways were often implemented by different enzyme families within different body sites, and 168 functional modules and 196 metabolic pathways varied in metagenomic abundance specifically to one or more niches within the microbiome. These included glycosaminoglycan degradation in the gut, as well as phosphate and amino acid transport linked to host phenotype (vaginal pH) in the posterior fornix. An implementation of our methodology is available at http://huttenhower.sph.harvard.edu/humann. This provides a means to accurately and efficiently characterize microbial metabolic pathways and functional modules directly from high-throughput sequencing reads, enabling the determination of community roles in the HMP cohort and in future metagenomic studies.

UR - http://www.scopus.com/inward/record.url?scp=84864037467&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84864037467&partnerID=8YFLogxK

U2 - 10.1371/journal.pcbi.1002358

DO - 10.1371/journal.pcbi.1002358

M3 - Article

VL - 8

JO - PLoS Computational Biology

JF - PLoS Computational Biology

SN - 1553-734X

IS - 6

M1 - e1002358

ER -