Evolutionary dynamics of influenza A nucleoprotein (NP) lineages revealed by large-scale sequence analyses

Jianpeng Xu, Mary C. Christman, Ruben O. Donis, Guoqing Lu

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Influenza A viral nucleoprotein (NP) plays a critical role in virus replication and host adaptation, however, the underlying molecular evolutionary dynamics of NP lineages are less well-understood. In this study, large-scale analyses of 5094 NP nucleotide sequences revealed eight distinct evolutionary lineages, including three host-specific lineages (human, classical swine and equine), two cross-host lineages (Eurasian avian-like swine and swine-origin human pandemic H1N1 2009) and three geographically isolated avian lineages (Eurasian, North American and Oceanian). The average nucleotide substitution rate of the NP lineages was estimated to be 2.4×10 -3 substitutions per site per year, with the highest value observed in pandemic H1N1 2009 (3.4×10 -3) and the lowest in equine (0.9×10 -3). The estimated time of most recent common ancestor (TMRCA) for each lineage demonstrated that the earliest human lineage was derived around 1906, and the latest pandemic H1N1 2009 lineage dated back to December 17, 2008. A marked time gap was found between the times when the viruses emerged and were first sampled, suggesting the crucial role for long-term surveillance of newly emerging viruses. The selection analyses showed that human lineage had six positive selection sites, whereas pandemic H1N1 2009, classical swine, Eurasian avian and Eurasian swine had only one or two sites. Protein structure analyses revealed several positive selection sites located in epitope regions or host adaptation regions, indicating strong adaptation to host immune system pressures in influenza viruses. Along with previous studies, this study provides new insights into the evolutionary dynamics of influenza A NP lineages. Further lineage analyses of other gene segments will allow better understanding of influenza A virus evolution and assist in the improvement of global influenza surveillance.

Original languageEnglish (US)
Pages (from-to)2125-2132
Number of pages8
JournalInfection, Genetics and Evolution
Volume11
Issue number8
DOIs
StatePublished - Dec 1 2011

Fingerprint

nucleoproteins
influenza
Nucleoproteins
Human Influenza
Sequence Analysis
Pandemics
pandemic
virus
Swine
swine
site selection
Horses
substitution
Viruses
horses
viruses
monitoring
Influenza A virus
immune system
common ancestry

Keywords

  • Influenza
  • Lineage
  • Nucleoprotein (NP)
  • Selection
  • Substitution rate
  • TMRCA

ASJC Scopus subject areas

  • Microbiology
  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics
  • Microbiology (medical)
  • Infectious Diseases

Cite this

Evolutionary dynamics of influenza A nucleoprotein (NP) lineages revealed by large-scale sequence analyses. / Xu, Jianpeng; Christman, Mary C.; Donis, Ruben O.; Lu, Guoqing.

In: Infection, Genetics and Evolution, Vol. 11, No. 8, 01.12.2011, p. 2125-2132.

Research output: Contribution to journalArticle

@article{b59e72a0507547af80ebf44dfa2e99f5,
title = "Evolutionary dynamics of influenza A nucleoprotein (NP) lineages revealed by large-scale sequence analyses",
abstract = "Influenza A viral nucleoprotein (NP) plays a critical role in virus replication and host adaptation, however, the underlying molecular evolutionary dynamics of NP lineages are less well-understood. In this study, large-scale analyses of 5094 NP nucleotide sequences revealed eight distinct evolutionary lineages, including three host-specific lineages (human, classical swine and equine), two cross-host lineages (Eurasian avian-like swine and swine-origin human pandemic H1N1 2009) and three geographically isolated avian lineages (Eurasian, North American and Oceanian). The average nucleotide substitution rate of the NP lineages was estimated to be 2.4×10 -3 substitutions per site per year, with the highest value observed in pandemic H1N1 2009 (3.4×10 -3) and the lowest in equine (0.9×10 -3). The estimated time of most recent common ancestor (TMRCA) for each lineage demonstrated that the earliest human lineage was derived around 1906, and the latest pandemic H1N1 2009 lineage dated back to December 17, 2008. A marked time gap was found between the times when the viruses emerged and were first sampled, suggesting the crucial role for long-term surveillance of newly emerging viruses. The selection analyses showed that human lineage had six positive selection sites, whereas pandemic H1N1 2009, classical swine, Eurasian avian and Eurasian swine had only one or two sites. Protein structure analyses revealed several positive selection sites located in epitope regions or host adaptation regions, indicating strong adaptation to host immune system pressures in influenza viruses. Along with previous studies, this study provides new insights into the evolutionary dynamics of influenza A NP lineages. Further lineage analyses of other gene segments will allow better understanding of influenza A virus evolution and assist in the improvement of global influenza surveillance.",
keywords = "Influenza, Lineage, Nucleoprotein (NP), Selection, Substitution rate, TMRCA",
author = "Jianpeng Xu and Christman, {Mary C.} and Donis, {Ruben O.} and Guoqing Lu",
year = "2011",
month = "12",
day = "1",
doi = "10.1016/j.meegid.2011.07.002",
language = "English (US)",
volume = "11",
pages = "2125--2132",
journal = "Infection, Genetics and Evolution",
issn = "1567-1348",
publisher = "Elsevier",
number = "8",

}

TY - JOUR

T1 - Evolutionary dynamics of influenza A nucleoprotein (NP) lineages revealed by large-scale sequence analyses

AU - Xu, Jianpeng

AU - Christman, Mary C.

AU - Donis, Ruben O.

AU - Lu, Guoqing

PY - 2011/12/1

Y1 - 2011/12/1

N2 - Influenza A viral nucleoprotein (NP) plays a critical role in virus replication and host adaptation, however, the underlying molecular evolutionary dynamics of NP lineages are less well-understood. In this study, large-scale analyses of 5094 NP nucleotide sequences revealed eight distinct evolutionary lineages, including three host-specific lineages (human, classical swine and equine), two cross-host lineages (Eurasian avian-like swine and swine-origin human pandemic H1N1 2009) and three geographically isolated avian lineages (Eurasian, North American and Oceanian). The average nucleotide substitution rate of the NP lineages was estimated to be 2.4×10 -3 substitutions per site per year, with the highest value observed in pandemic H1N1 2009 (3.4×10 -3) and the lowest in equine (0.9×10 -3). The estimated time of most recent common ancestor (TMRCA) for each lineage demonstrated that the earliest human lineage was derived around 1906, and the latest pandemic H1N1 2009 lineage dated back to December 17, 2008. A marked time gap was found between the times when the viruses emerged and were first sampled, suggesting the crucial role for long-term surveillance of newly emerging viruses. The selection analyses showed that human lineage had six positive selection sites, whereas pandemic H1N1 2009, classical swine, Eurasian avian and Eurasian swine had only one or two sites. Protein structure analyses revealed several positive selection sites located in epitope regions or host adaptation regions, indicating strong adaptation to host immune system pressures in influenza viruses. Along with previous studies, this study provides new insights into the evolutionary dynamics of influenza A NP lineages. Further lineage analyses of other gene segments will allow better understanding of influenza A virus evolution and assist in the improvement of global influenza surveillance.

AB - Influenza A viral nucleoprotein (NP) plays a critical role in virus replication and host adaptation, however, the underlying molecular evolutionary dynamics of NP lineages are less well-understood. In this study, large-scale analyses of 5094 NP nucleotide sequences revealed eight distinct evolutionary lineages, including three host-specific lineages (human, classical swine and equine), two cross-host lineages (Eurasian avian-like swine and swine-origin human pandemic H1N1 2009) and three geographically isolated avian lineages (Eurasian, North American and Oceanian). The average nucleotide substitution rate of the NP lineages was estimated to be 2.4×10 -3 substitutions per site per year, with the highest value observed in pandemic H1N1 2009 (3.4×10 -3) and the lowest in equine (0.9×10 -3). The estimated time of most recent common ancestor (TMRCA) for each lineage demonstrated that the earliest human lineage was derived around 1906, and the latest pandemic H1N1 2009 lineage dated back to December 17, 2008. A marked time gap was found between the times when the viruses emerged and were first sampled, suggesting the crucial role for long-term surveillance of newly emerging viruses. The selection analyses showed that human lineage had six positive selection sites, whereas pandemic H1N1 2009, classical swine, Eurasian avian and Eurasian swine had only one or two sites. Protein structure analyses revealed several positive selection sites located in epitope regions or host adaptation regions, indicating strong adaptation to host immune system pressures in influenza viruses. Along with previous studies, this study provides new insights into the evolutionary dynamics of influenza A NP lineages. Further lineage analyses of other gene segments will allow better understanding of influenza A virus evolution and assist in the improvement of global influenza surveillance.

KW - Influenza

KW - Lineage

KW - Nucleoprotein (NP)

KW - Selection

KW - Substitution rate

KW - TMRCA

UR - http://www.scopus.com/inward/record.url?scp=82655181926&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=82655181926&partnerID=8YFLogxK

U2 - 10.1016/j.meegid.2011.07.002

DO - 10.1016/j.meegid.2011.07.002

M3 - Article

C2 - 21763464

AN - SCOPUS:82655181926

VL - 11

SP - 2125

EP - 2132

JO - Infection, Genetics and Evolution

JF - Infection, Genetics and Evolution

SN - 1567-1348

IS - 8

ER -