Prediction of protein phosphorylation sites by integrating secondary structure information and other one-dimensional structural properties

Yongchao Dou, Bo Yao, Chi Zhang

Research output: Chapter in Book/Report/Conference proceedingChapter

2 Scopus citations

Abstract

Studies on phosphorylation are important but challenging for both wet-bench experiments and computational studies, and accurate non-kinase-specific prediction tools are highly desirable for whole-genome annotation in a wide variety of species. Here, we describe a phosphorylation site prediction webserver, PhosphoSVM, that employs Support Vector Machine to combine protein secondary structure information and seven other one-dimensional structural properties, including Shannon entropy, relative entropy, predicted protein disorder information, predicted solvent accessible area, amino acid overlapping properties, averaged cumulative hydrophobicity, and subsequence k-nearest neighbor profiles. This method achieved AUC values of 0.8405/0.8183/0.7383 for serine (S), threonine (T), and tyrosine (Y) phosphorylation sites, respectively, in animals with a tenfold cross-validation. The model trained by the animal phosphorylation sites was also applied to a plant phosphorylation site dataset as an independent test. The AUC values for the independent test data set were 0.7761/0.6652/0.5958 for S/T/Y phosphorylation sites, respectively. This algorithm with the optimally trained model was implemented as a webserver. The webserver, trained model, and all datasets used in the current study are available at http://sysbio.unl.edu/PhosphoSVM.

Original languageEnglish (US)
Title of host publicationMethods in Molecular Biology
PublisherHumana Press Inc.
Pages265-274
Number of pages10
DOIs
StatePublished - 2017

Publication series

NameMethods in Molecular Biology
Volume1484
ISSN (Print)1064-3745

Keywords

  • Non-kinase-specific tool
  • Phosphorylation site prediction
  • Support vector machine

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics

Fingerprint

Dive into the research topics of 'Prediction of protein phosphorylation sites by integrating secondary structure information and other one-dimensional structural properties'. Together they form a unique fingerprint.

Cite this