Protein binding site prediction using an empirical scoring function

Shide Liang, Chi Zhang, Song Liu, Yaoqi Zhou

Research output: Contribution to journalArticle

173 Citations (Scopus)

Abstract

Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2% coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5% accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15%. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4%, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches.

Original languageEnglish (US)
Pages (from-to)3698-3707
Number of pages10
JournalNucleic acids research
Volume34
Issue number13
DOIs
StatePublished - Sep 22 2006

Fingerprint

Protein Binding
Binding Sites
Proteins
Biological Phenomena
Nucleic Acids
Weights and Measures

ASJC Scopus subject areas

  • Genetics

Cite this

Protein binding site prediction using an empirical scoring function. / Liang, Shide; Zhang, Chi; Liu, Song; Zhou, Yaoqi.

In: Nucleic acids research, Vol. 34, No. 13, 22.09.2006, p. 3698-3707.

Research output: Contribution to journalArticle

Liang, Shide ; Zhang, Chi ; Liu, Song ; Zhou, Yaoqi. / Protein binding site prediction using an empirical scoring function. In: Nucleic acids research. 2006 ; Vol. 34, No. 13. pp. 3698-3707.
@article{197390ab69394f0091fa738666e4b319,
title = "Protein binding site prediction using an empirical scoring function",
abstract = "Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2{\%} coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5{\%} accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15{\%}. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4{\%}, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches.",
author = "Shide Liang and Chi Zhang and Song Liu and Yaoqi Zhou",
year = "2006",
month = "9",
day = "22",
doi = "10.1093/nar/gkl454",
language = "English (US)",
volume = "34",
pages = "3698--3707",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "13",

}

TY - JOUR

T1 - Protein binding site prediction using an empirical scoring function

AU - Liang, Shide

AU - Zhang, Chi

AU - Liu, Song

AU - Zhou, Yaoqi

PY - 2006/9/22

Y1 - 2006/9/22

N2 - Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2% coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5% accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15%. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4%, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches.

AB - Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2% coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5% accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15%. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4%, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches.

UR - http://www.scopus.com/inward/record.url?scp=33747150197&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33747150197&partnerID=8YFLogxK

U2 - 10.1093/nar/gkl454

DO - 10.1093/nar/gkl454

M3 - Article

C2 - 16893954

AN - SCOPUS:33747150197

VL - 34

SP - 3698

EP - 3707

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - 13

ER -