The dependence of all-atom statistical potentials on structural training database

Chi Zhang, Song Liu, Hongyi Zhou, Yaoqi Zhou

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

An accurate statistical energy function that is suitable for the prediction of protein structures of all classes should be independent of the structural database used for energy extraction. Here, two high-resolution, low-sequence-identity structural databases of 333 α-proteins and 271 β-proteins were built for examining the database dependence of three all-atom statistical energy functions. They are RAPDF (residue-specific all-atom conditional probability discriminatory function), atomic KBP (atomic knowledge-based potential), and DFIRE (statistical potential based on distance-scaled finite ideal-gas reference state). These energy functions differ in the reference states used for energy derivation. The energy functions extracted from the different structural databases are used to select native structures from multiple decoys of 64 α-proteins and 28 β-proteins. The performance in native structure selections indicates that the DFIRE-based energy function is mostly independent of the structural database whereas RAPDF and KBP have a significant dependence. The construction of two additional structural databases of α/β and α + β-proteins further confirmed the weak dependence of DFIRE on the structural databases of various structural classes. The possible source for the difference between the three all-atom statistical energy functions is that the physical reference state of ideal gas used in the DFIRE-based energy function is least dependent on the structural database.

Original languageEnglish (US)
Pages (from-to)3349-3358
Number of pages10
JournalBiophysical journal
Volume86
Issue number6
DOIs
StatePublished - Jan 1 2004

Fingerprint

Databases
Protein Databases
Proteins
Gases

ASJC Scopus subject areas

  • Biophysics

Cite this

The dependence of all-atom statistical potentials on structural training database. / Zhang, Chi; Liu, Song; Zhou, Hongyi; Zhou, Yaoqi.

In: Biophysical journal, Vol. 86, No. 6, 01.01.2004, p. 3349-3358.

Research output: Contribution to journalArticle

Zhang, Chi ; Liu, Song ; Zhou, Hongyi ; Zhou, Yaoqi. / The dependence of all-atom statistical potentials on structural training database. In: Biophysical journal. 2004 ; Vol. 86, No. 6. pp. 3349-3358.
@article{f86dbc3c9d864deeb940cc87b505290e,
title = "The dependence of all-atom statistical potentials on structural training database",
abstract = "An accurate statistical energy function that is suitable for the prediction of protein structures of all classes should be independent of the structural database used for energy extraction. Here, two high-resolution, low-sequence-identity structural databases of 333 α-proteins and 271 β-proteins were built for examining the database dependence of three all-atom statistical energy functions. They are RAPDF (residue-specific all-atom conditional probability discriminatory function), atomic KBP (atomic knowledge-based potential), and DFIRE (statistical potential based on distance-scaled finite ideal-gas reference state). These energy functions differ in the reference states used for energy derivation. The energy functions extracted from the different structural databases are used to select native structures from multiple decoys of 64 α-proteins and 28 β-proteins. The performance in native structure selections indicates that the DFIRE-based energy function is mostly independent of the structural database whereas RAPDF and KBP have a significant dependence. The construction of two additional structural databases of α/β and α + β-proteins further confirmed the weak dependence of DFIRE on the structural databases of various structural classes. The possible source for the difference between the three all-atom statistical energy functions is that the physical reference state of ideal gas used in the DFIRE-based energy function is least dependent on the structural database.",
author = "Chi Zhang and Song Liu and Hongyi Zhou and Yaoqi Zhou",
year = "2004",
month = "1",
day = "1",
doi = "10.1529/biophysj.103.035998",
language = "English (US)",
volume = "86",
pages = "3349--3358",
journal = "Biophysical Journal",
issn = "0006-3495",
publisher = "Biophysical Society",
number = "6",

}

TY - JOUR

T1 - The dependence of all-atom statistical potentials on structural training database

AU - Zhang, Chi

AU - Liu, Song

AU - Zhou, Hongyi

AU - Zhou, Yaoqi

PY - 2004/1/1

Y1 - 2004/1/1

N2 - An accurate statistical energy function that is suitable for the prediction of protein structures of all classes should be independent of the structural database used for energy extraction. Here, two high-resolution, low-sequence-identity structural databases of 333 α-proteins and 271 β-proteins were built for examining the database dependence of three all-atom statistical energy functions. They are RAPDF (residue-specific all-atom conditional probability discriminatory function), atomic KBP (atomic knowledge-based potential), and DFIRE (statistical potential based on distance-scaled finite ideal-gas reference state). These energy functions differ in the reference states used for energy derivation. The energy functions extracted from the different structural databases are used to select native structures from multiple decoys of 64 α-proteins and 28 β-proteins. The performance in native structure selections indicates that the DFIRE-based energy function is mostly independent of the structural database whereas RAPDF and KBP have a significant dependence. The construction of two additional structural databases of α/β and α + β-proteins further confirmed the weak dependence of DFIRE on the structural databases of various structural classes. The possible source for the difference between the three all-atom statistical energy functions is that the physical reference state of ideal gas used in the DFIRE-based energy function is least dependent on the structural database.

AB - An accurate statistical energy function that is suitable for the prediction of protein structures of all classes should be independent of the structural database used for energy extraction. Here, two high-resolution, low-sequence-identity structural databases of 333 α-proteins and 271 β-proteins were built for examining the database dependence of three all-atom statistical energy functions. They are RAPDF (residue-specific all-atom conditional probability discriminatory function), atomic KBP (atomic knowledge-based potential), and DFIRE (statistical potential based on distance-scaled finite ideal-gas reference state). These energy functions differ in the reference states used for energy derivation. The energy functions extracted from the different structural databases are used to select native structures from multiple decoys of 64 α-proteins and 28 β-proteins. The performance in native structure selections indicates that the DFIRE-based energy function is mostly independent of the structural database whereas RAPDF and KBP have a significant dependence. The construction of two additional structural databases of α/β and α + β-proteins further confirmed the weak dependence of DFIRE on the structural databases of various structural classes. The possible source for the difference between the three all-atom statistical energy functions is that the physical reference state of ideal gas used in the DFIRE-based energy function is least dependent on the structural database.

UR - http://www.scopus.com/inward/record.url?scp=2942694535&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=2942694535&partnerID=8YFLogxK

U2 - 10.1529/biophysj.103.035998

DO - 10.1529/biophysj.103.035998

M3 - Article

VL - 86

SP - 3349

EP - 3358

JO - Biophysical Journal

JF - Biophysical Journal

SN - 0006-3495

IS - 6

ER -