Protein binding site prediction using an empirical scoring function

Shide Liang, Chi Zhang, Song Liu, Yaoqi Zhou

Research output: Contribution to journalArticlepeer-review

190 Scopus citations


Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2% coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5% accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15%. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4%, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches.

Original languageEnglish (US)
Pages (from-to)3698-3707
Number of pages10
JournalNucleic acids research
Issue number13
StatePublished - 2006
Externally publishedYes

ASJC Scopus subject areas

  • Genetics


Dive into the research topics of 'Protein binding site prediction using an empirical scoring function'. Together they form a unique fingerprint.

Cite this