Systematic assessment of accuracy of comparative model of proteins belonging to different structural fold classes

Suvobrata Chakravarty, Dario Ghersi, Roberto Sanchez

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

In the absence of experimental structures, comparative modeling continues to be the chosen method for retrieving structural information on target proteins. However, models lack the accuracy of experimental structures. Alignment error and structural divergence (between target and template) influence model accuracy the most. Here, we examine the potential additional impact of backbone geometry, as our previous studies have suggested that the structural class (all-α, αβ, all-β) of a protein may influence the accuracy of its model. In the twilight zone (sequence identity ≤ 30%) and at a similar level of target-template divergence, the accuracy of protein models does indeed follow the trend all-α > αβ > all-β. This is mainly because the alignment accuracy follows the same trend (all-α > αβ > all-β), with backbone geometry playing only a minor role. Differences in the diversity of sequences belonging to different structural classes leads to the observed accuracy differences, thus enabling the accuracy of alignments/models to be estimated a priori in a class-dependent manner. This study provides a systematic description of and quantifies the structural class-dependent effect in comparative modeling. The study also suggests that datasets for large-scale sequence/structure analyses should have equal representations of different structural classes to avoid class-dependent bias.

Original languageEnglish (US)
Pages (from-to)2831-2837
Number of pages7
JournalJournal of Molecular Modeling
Volume17
Issue number11
DOIs
StatePublished - Nov 1 2011

Fingerprint

proteins
Proteins
alignment
divergence
Geometry
templates
trends
geometry

Keywords

  • Alignment accuracy
  • Homology modeling
  • Information content
  • Model accuracy
  • Secondary structure
  • Sequence alignment

ASJC Scopus subject areas

  • Catalysis
  • Computer Science Applications
  • Physical and Theoretical Chemistry
  • Organic Chemistry
  • Computational Theory and Mathematics
  • Inorganic Chemistry

Cite this

Systematic assessment of accuracy of comparative model of proteins belonging to different structural fold classes. / Chakravarty, Suvobrata; Ghersi, Dario; Sanchez, Roberto.

In: Journal of Molecular Modeling, Vol. 17, No. 11, 01.11.2011, p. 2831-2837.

Research output: Contribution to journalArticle

@article{6e680e9e95bb45a5a96fdb2b69434f25,
title = "Systematic assessment of accuracy of comparative model of proteins belonging to different structural fold classes",
abstract = "In the absence of experimental structures, comparative modeling continues to be the chosen method for retrieving structural information on target proteins. However, models lack the accuracy of experimental structures. Alignment error and structural divergence (between target and template) influence model accuracy the most. Here, we examine the potential additional impact of backbone geometry, as our previous studies have suggested that the structural class (all-α, αβ, all-β) of a protein may influence the accuracy of its model. In the twilight zone (sequence identity ≤ 30{\%}) and at a similar level of target-template divergence, the accuracy of protein models does indeed follow the trend all-α > αβ > all-β. This is mainly because the alignment accuracy follows the same trend (all-α > αβ > all-β), with backbone geometry playing only a minor role. Differences in the diversity of sequences belonging to different structural classes leads to the observed accuracy differences, thus enabling the accuracy of alignments/models to be estimated a priori in a class-dependent manner. This study provides a systematic description of and quantifies the structural class-dependent effect in comparative modeling. The study also suggests that datasets for large-scale sequence/structure analyses should have equal representations of different structural classes to avoid class-dependent bias.",
keywords = "Alignment accuracy, Homology modeling, Information content, Model accuracy, Secondary structure, Sequence alignment",
author = "Suvobrata Chakravarty and Dario Ghersi and Roberto Sanchez",
year = "2011",
month = "11",
day = "1",
doi = "10.1007/s00894-011-0976-9",
language = "English (US)",
volume = "17",
pages = "2831--2837",
journal = "Journal of Molecular Modeling",
issn = "1610-2940",
publisher = "Springer Verlag",
number = "11",

}

TY - JOUR

T1 - Systematic assessment of accuracy of comparative model of proteins belonging to different structural fold classes

AU - Chakravarty, Suvobrata

AU - Ghersi, Dario

AU - Sanchez, Roberto

PY - 2011/11/1

Y1 - 2011/11/1

N2 - In the absence of experimental structures, comparative modeling continues to be the chosen method for retrieving structural information on target proteins. However, models lack the accuracy of experimental structures. Alignment error and structural divergence (between target and template) influence model accuracy the most. Here, we examine the potential additional impact of backbone geometry, as our previous studies have suggested that the structural class (all-α, αβ, all-β) of a protein may influence the accuracy of its model. In the twilight zone (sequence identity ≤ 30%) and at a similar level of target-template divergence, the accuracy of protein models does indeed follow the trend all-α > αβ > all-β. This is mainly because the alignment accuracy follows the same trend (all-α > αβ > all-β), with backbone geometry playing only a minor role. Differences in the diversity of sequences belonging to different structural classes leads to the observed accuracy differences, thus enabling the accuracy of alignments/models to be estimated a priori in a class-dependent manner. This study provides a systematic description of and quantifies the structural class-dependent effect in comparative modeling. The study also suggests that datasets for large-scale sequence/structure analyses should have equal representations of different structural classes to avoid class-dependent bias.

AB - In the absence of experimental structures, comparative modeling continues to be the chosen method for retrieving structural information on target proteins. However, models lack the accuracy of experimental structures. Alignment error and structural divergence (between target and template) influence model accuracy the most. Here, we examine the potential additional impact of backbone geometry, as our previous studies have suggested that the structural class (all-α, αβ, all-β) of a protein may influence the accuracy of its model. In the twilight zone (sequence identity ≤ 30%) and at a similar level of target-template divergence, the accuracy of protein models does indeed follow the trend all-α > αβ > all-β. This is mainly because the alignment accuracy follows the same trend (all-α > αβ > all-β), with backbone geometry playing only a minor role. Differences in the diversity of sequences belonging to different structural classes leads to the observed accuracy differences, thus enabling the accuracy of alignments/models to be estimated a priori in a class-dependent manner. This study provides a systematic description of and quantifies the structural class-dependent effect in comparative modeling. The study also suggests that datasets for large-scale sequence/structure analyses should have equal representations of different structural classes to avoid class-dependent bias.

KW - Alignment accuracy

KW - Homology modeling

KW - Information content

KW - Model accuracy

KW - Secondary structure

KW - Sequence alignment

UR - http://www.scopus.com/inward/record.url?scp=80255140369&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80255140369&partnerID=8YFLogxK

U2 - 10.1007/s00894-011-0976-9

DO - 10.1007/s00894-011-0976-9

M3 - Article

C2 - 21301906

AN - SCOPUS:80255140369

VL - 17

SP - 2831

EP - 2837

JO - Journal of Molecular Modeling

JF - Journal of Molecular Modeling

SN - 1610-2940

IS - 11

ER -