Abstract
The quality of protein function predictions relies on appropriate training of protein classification methods. Performance of these methods can be affected when only a limited number of protein samples are available, which is often the case in divergent protein families. Whereas profile hidden Markov models and PSI-BLAST presented significant performance decrease in such cases, alignment-free partial least-squares classifiers performed consistently better even when used to identify short fragmented sequences.
Original language | English (US) |
---|---|
Pages (from-to) | 846-853 |
Number of pages | 8 |
Journal | Journal of proteome research |
Volume | 6 |
Issue number | 2 |
DOIs | |
State | Published - Feb 2007 |
Keywords
- Amino acid composition
- G-protein coupled receptors
- Partial least square
- Physico-chemical properties
- Profile hidden Markov model
ASJC Scopus subject areas
- General Chemistry
- Biochemistry