Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency

William P. Bone, Nicole L. Washington, Orion J. Buske, David R. Adams, Joie Davis, David Draper, Elise D. Flynn, Marta Girdea, Rena Godfrey, Gretchen Golas, Catherine Groden, Julius Jacobsen, Sebastian Köhler, Elizabeth M.J. Lee, Amanda E. Links, Thomas C. Markello, Christopher J. Mungall, Michele Nehrebecky, Peter N. Robinson, Murat SincanAriane G. Soldatos, Cynthia J. Tifft, Camilo Toro, Heather Trang, Elise Valkanas, Nicole Vasilevsky, Colleen Wahl, Lynne A. Wolfe, Cornelius F. Boerkoel, Michael Brudno, Melissa A. Haendel, William A. Gahl, Damian Smedley

Research output: Contribution to journalArticle

39 Citations (Scopus)

Abstract

Purpose: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. Methods: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease-gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein-protein association neighbors. Results: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease-gene associations and ranked the correct seeded variant in up to 87% when detectable disease-gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. Conclusion: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.

Original languageEnglish (US)
Pages (from-to)608-617
Number of pages10
JournalGenetics in Medicine
Volume18
Issue number6
DOIs
StatePublished - Jun 1 2016

Fingerprint

Exome
Phenotype
Genes
Benchmarking
Inborn Genetic Diseases
National Institutes of Health (U.S.)
Rare Diseases
Proteins

Keywords

  • exome sequencing
  • model organisms
  • phenotype
  • semantic comparison
  • undiagnosed diseases

ASJC Scopus subject areas

  • Genetics(clinical)

Cite this

Bone, W. P., Washington, N. L., Buske, O. J., Adams, D. R., Davis, J., Draper, D., ... Smedley, D. (2016). Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency. Genetics in Medicine, 18(6), 608-617. https://doi.org/10.1038/gim.2015.137

Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency. / Bone, William P.; Washington, Nicole L.; Buske, Orion J.; Adams, David R.; Davis, Joie; Draper, David; Flynn, Elise D.; Girdea, Marta; Godfrey, Rena; Golas, Gretchen; Groden, Catherine; Jacobsen, Julius; Köhler, Sebastian; Lee, Elizabeth M.J.; Links, Amanda E.; Markello, Thomas C.; Mungall, Christopher J.; Nehrebecky, Michele; Robinson, Peter N.; Sincan, Murat; Soldatos, Ariane G.; Tifft, Cynthia J.; Toro, Camilo; Trang, Heather; Valkanas, Elise; Vasilevsky, Nicole; Wahl, Colleen; Wolfe, Lynne A.; Boerkoel, Cornelius F.; Brudno, Michael; Haendel, Melissa A.; Gahl, William A.; Smedley, Damian.

In: Genetics in Medicine, Vol. 18, No. 6, 01.06.2016, p. 608-617.

Research output: Contribution to journalArticle

Bone, WP, Washington, NL, Buske, OJ, Adams, DR, Davis, J, Draper, D, Flynn, ED, Girdea, M, Godfrey, R, Golas, G, Groden, C, Jacobsen, J, Köhler, S, Lee, EMJ, Links, AE, Markello, TC, Mungall, CJ, Nehrebecky, M, Robinson, PN, Sincan, M, Soldatos, AG, Tifft, CJ, Toro, C, Trang, H, Valkanas, E, Vasilevsky, N, Wahl, C, Wolfe, LA, Boerkoel, CF, Brudno, M, Haendel, MA, Gahl, WA & Smedley, D 2016, 'Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency', Genetics in Medicine, vol. 18, no. 6, pp. 608-617. https://doi.org/10.1038/gim.2015.137
Bone, William P. ; Washington, Nicole L. ; Buske, Orion J. ; Adams, David R. ; Davis, Joie ; Draper, David ; Flynn, Elise D. ; Girdea, Marta ; Godfrey, Rena ; Golas, Gretchen ; Groden, Catherine ; Jacobsen, Julius ; Köhler, Sebastian ; Lee, Elizabeth M.J. ; Links, Amanda E. ; Markello, Thomas C. ; Mungall, Christopher J. ; Nehrebecky, Michele ; Robinson, Peter N. ; Sincan, Murat ; Soldatos, Ariane G. ; Tifft, Cynthia J. ; Toro, Camilo ; Trang, Heather ; Valkanas, Elise ; Vasilevsky, Nicole ; Wahl, Colleen ; Wolfe, Lynne A. ; Boerkoel, Cornelius F. ; Brudno, Michael ; Haendel, Melissa A. ; Gahl, William A. ; Smedley, Damian. / Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency. In: Genetics in Medicine. 2016 ; Vol. 18, No. 6. pp. 608-617.
@article{fdf4227cf39c46b9a5f6b351ee251bb9,
title = "Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency",
abstract = "Purpose: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. Methods: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease-gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein-protein association neighbors. Results: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97{\%} of known disease-gene associations and ranked the correct seeded variant in up to 87{\%} when detectable disease-gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. Conclusion: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.",
keywords = "exome sequencing, model organisms, phenotype, semantic comparison, undiagnosed diseases",
author = "Bone, {William P.} and Washington, {Nicole L.} and Buske, {Orion J.} and Adams, {David R.} and Joie Davis and David Draper and Flynn, {Elise D.} and Marta Girdea and Rena Godfrey and Gretchen Golas and Catherine Groden and Julius Jacobsen and Sebastian K{\"o}hler and Lee, {Elizabeth M.J.} and Links, {Amanda E.} and Markello, {Thomas C.} and Mungall, {Christopher J.} and Michele Nehrebecky and Robinson, {Peter N.} and Murat Sincan and Soldatos, {Ariane G.} and Tifft, {Cynthia J.} and Camilo Toro and Heather Trang and Elise Valkanas and Nicole Vasilevsky and Colleen Wahl and Wolfe, {Lynne A.} and Boerkoel, {Cornelius F.} and Michael Brudno and Haendel, {Melissa A.} and Gahl, {William A.} and Damian Smedley",
year = "2016",
month = "6",
day = "1",
doi = "10.1038/gim.2015.137",
language = "English (US)",
volume = "18",
pages = "608--617",
journal = "Genetics in Medicine",
issn = "1098-3600",
publisher = "Lippincott Williams and Wilkins",
number = "6",

}

TY - JOUR

T1 - Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency

AU - Bone, William P.

AU - Washington, Nicole L.

AU - Buske, Orion J.

AU - Adams, David R.

AU - Davis, Joie

AU - Draper, David

AU - Flynn, Elise D.

AU - Girdea, Marta

AU - Godfrey, Rena

AU - Golas, Gretchen

AU - Groden, Catherine

AU - Jacobsen, Julius

AU - Köhler, Sebastian

AU - Lee, Elizabeth M.J.

AU - Links, Amanda E.

AU - Markello, Thomas C.

AU - Mungall, Christopher J.

AU - Nehrebecky, Michele

AU - Robinson, Peter N.

AU - Sincan, Murat

AU - Soldatos, Ariane G.

AU - Tifft, Cynthia J.

AU - Toro, Camilo

AU - Trang, Heather

AU - Valkanas, Elise

AU - Vasilevsky, Nicole

AU - Wahl, Colleen

AU - Wolfe, Lynne A.

AU - Boerkoel, Cornelius F.

AU - Brudno, Michael

AU - Haendel, Melissa A.

AU - Gahl, William A.

AU - Smedley, Damian

PY - 2016/6/1

Y1 - 2016/6/1

N2 - Purpose: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. Methods: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease-gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein-protein association neighbors. Results: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease-gene associations and ranked the correct seeded variant in up to 87% when detectable disease-gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. Conclusion: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.

AB - Purpose: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. Methods: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease-gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein-protein association neighbors. Results: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease-gene associations and ranked the correct seeded variant in up to 87% when detectable disease-gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. Conclusion: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.

KW - exome sequencing

KW - model organisms

KW - phenotype

KW - semantic comparison

KW - undiagnosed diseases

UR - http://www.scopus.com/inward/record.url?scp=84975063087&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84975063087&partnerID=8YFLogxK

U2 - 10.1038/gim.2015.137

DO - 10.1038/gim.2015.137

M3 - Article

C2 - 26562225

AN - SCOPUS:84975063087

VL - 18

SP - 608

EP - 617

JO - Genetics in Medicine

JF - Genetics in Medicine

SN - 1098-3600

IS - 6

ER -