Advantages of an improved rhesus macaque genome for evolutionary analyses

Julien S. Gradnigo, Abhishek Majumdar, Robert B Norgren, Etsuko Moriyama

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

The rhesus macaque (Macaca mulatta) is widely used in molecular evolutionary analyses, particularly to identify genes under adaptive or unique evolution in the human lineage. For such studies, it is necessary to align nucleotide sequences of homologous protein-coding genes among multiple species. The validity of these analyses is dependent on high quality genomic data. However, for most mammalian species (other than humans and mice), only draft genomes are available. There has been concern that some results obtained from evolutionary analyses using draft genomes may not be correct. The rhesus macaque provides a unique opportunity to determine whether an improved genome (MacaM) yields better results than a draft genome (rheMac2) for evolutionary studies. We compared protein-coding genes annotated in the rheMac2 and MacaM genomes with their human orthologs. We found many genes annotated in rheMac2 had apparently spurious sequences not present in genes derived from MacaM. The rheMac2 annotations also appeared to inflate a frequently used evolutionary index, ω (the ratio of nonsynonymous to synonymous substitution rates). Genes with these spurious sequences must be filtered out from evolutionary analyses to obtain correct results. With the MacaM genome, improved sequence information means many more genes can be examined for indications of selection. These results indicate how upgrading genomes from draft status to a higher level of quality can improve interpretation of evolutionary patterns.

Original languageEnglish (US)
Article numbere0167376
JournalPloS one
Volume11
Issue number12
DOIs
StatePublished - Dec 2016

Fingerprint

Macaca mulatta
Genes
Genome
genome
genes
Proteins
proteins
genomics
nucleotide sequences
mice
Substitution reactions
Nucleotides

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

Advantages of an improved rhesus macaque genome for evolutionary analyses. / Gradnigo, Julien S.; Majumdar, Abhishek; Norgren, Robert B; Moriyama, Etsuko.

In: PloS one, Vol. 11, No. 12, e0167376, 12.2016.

Research output: Contribution to journalArticle

@article{27fac9a8ab7345f191a7de92d492c9cc,
title = "Advantages of an improved rhesus macaque genome for evolutionary analyses",
abstract = "The rhesus macaque (Macaca mulatta) is widely used in molecular evolutionary analyses, particularly to identify genes under adaptive or unique evolution in the human lineage. For such studies, it is necessary to align nucleotide sequences of homologous protein-coding genes among multiple species. The validity of these analyses is dependent on high quality genomic data. However, for most mammalian species (other than humans and mice), only draft genomes are available. There has been concern that some results obtained from evolutionary analyses using draft genomes may not be correct. The rhesus macaque provides a unique opportunity to determine whether an improved genome (MacaM) yields better results than a draft genome (rheMac2) for evolutionary studies. We compared protein-coding genes annotated in the rheMac2 and MacaM genomes with their human orthologs. We found many genes annotated in rheMac2 had apparently spurious sequences not present in genes derived from MacaM. The rheMac2 annotations also appeared to inflate a frequently used evolutionary index, ω (the ratio of nonsynonymous to synonymous substitution rates). Genes with these spurious sequences must be filtered out from evolutionary analyses to obtain correct results. With the MacaM genome, improved sequence information means many more genes can be examined for indications of selection. These results indicate how upgrading genomes from draft status to a higher level of quality can improve interpretation of evolutionary patterns.",
author = "Gradnigo, {Julien S.} and Abhishek Majumdar and Norgren, {Robert B} and Etsuko Moriyama",
year = "2016",
month = "12",
doi = "10.1371/journal.pone.0167376",
language = "English (US)",
volume = "11",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "12",

}

TY - JOUR

T1 - Advantages of an improved rhesus macaque genome for evolutionary analyses

AU - Gradnigo, Julien S.

AU - Majumdar, Abhishek

AU - Norgren, Robert B

AU - Moriyama, Etsuko

PY - 2016/12

Y1 - 2016/12

N2 - The rhesus macaque (Macaca mulatta) is widely used in molecular evolutionary analyses, particularly to identify genes under adaptive or unique evolution in the human lineage. For such studies, it is necessary to align nucleotide sequences of homologous protein-coding genes among multiple species. The validity of these analyses is dependent on high quality genomic data. However, for most mammalian species (other than humans and mice), only draft genomes are available. There has been concern that some results obtained from evolutionary analyses using draft genomes may not be correct. The rhesus macaque provides a unique opportunity to determine whether an improved genome (MacaM) yields better results than a draft genome (rheMac2) for evolutionary studies. We compared protein-coding genes annotated in the rheMac2 and MacaM genomes with their human orthologs. We found many genes annotated in rheMac2 had apparently spurious sequences not present in genes derived from MacaM. The rheMac2 annotations also appeared to inflate a frequently used evolutionary index, ω (the ratio of nonsynonymous to synonymous substitution rates). Genes with these spurious sequences must be filtered out from evolutionary analyses to obtain correct results. With the MacaM genome, improved sequence information means many more genes can be examined for indications of selection. These results indicate how upgrading genomes from draft status to a higher level of quality can improve interpretation of evolutionary patterns.

AB - The rhesus macaque (Macaca mulatta) is widely used in molecular evolutionary analyses, particularly to identify genes under adaptive or unique evolution in the human lineage. For such studies, it is necessary to align nucleotide sequences of homologous protein-coding genes among multiple species. The validity of these analyses is dependent on high quality genomic data. However, for most mammalian species (other than humans and mice), only draft genomes are available. There has been concern that some results obtained from evolutionary analyses using draft genomes may not be correct. The rhesus macaque provides a unique opportunity to determine whether an improved genome (MacaM) yields better results than a draft genome (rheMac2) for evolutionary studies. We compared protein-coding genes annotated in the rheMac2 and MacaM genomes with their human orthologs. We found many genes annotated in rheMac2 had apparently spurious sequences not present in genes derived from MacaM. The rheMac2 annotations also appeared to inflate a frequently used evolutionary index, ω (the ratio of nonsynonymous to synonymous substitution rates). Genes with these spurious sequences must be filtered out from evolutionary analyses to obtain correct results. With the MacaM genome, improved sequence information means many more genes can be examined for indications of selection. These results indicate how upgrading genomes from draft status to a higher level of quality can improve interpretation of evolutionary patterns.

UR - http://www.scopus.com/inward/record.url?scp=85002428393&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85002428393&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0167376

DO - 10.1371/journal.pone.0167376

M3 - Article

C2 - 27911958

AN - SCOPUS:85002428393

VL - 11

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 12

M1 - e0167376

ER -