Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity.

Lianyi Han, Juan Cui, Honghuang Lin, Zhiliang Ji, Zhiwei Cao, Yixue Li, Yuzong Chen

Research output: Contribution to journalReview article

58 Citations (Scopus)

Abstract

Protein sequence contains clues to its function. Functional prediction from sequence presents a challenge particularly for proteins that have low or no sequence similarity to proteins of known function. Recently, machine learning methods have been explored for predicting functional class of proteins from sequence-derived properties independent of sequence similarity, which showed promising potential for low- and non-homologous proteins. These methods can thus be explored as potential tools to complement alignment- and clustering-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using machine learning methods for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented, which need to be interpreted with caution as they are dependent on such factors as datasets used and choice of parameters.

Original languageEnglish (US)
Pages (from-to)4023-4037
Number of pages15
JournalProteomics
Volume6
Issue number14
DOIs
StatePublished - Jan 1 2006

Fingerprint

Learning systems
Proteins
Machine Learning
Cluster Analysis
Servers
Software

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology

Cite this

Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity. / Han, Lianyi; Cui, Juan; Lin, Honghuang; Ji, Zhiliang; Cao, Zhiwei; Li, Yixue; Chen, Yuzong.

In: Proteomics, Vol. 6, No. 14, 01.01.2006, p. 4023-4037.

Research output: Contribution to journalReview article

Han, Lianyi ; Cui, Juan ; Lin, Honghuang ; Ji, Zhiliang ; Cao, Zhiwei ; Li, Yixue ; Chen, Yuzong. / Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity. In: Proteomics. 2006 ; Vol. 6, No. 14. pp. 4023-4037.
@article{97fad38c211542d29d7f11f17dbd778e,
title = "Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity.",
abstract = "Protein sequence contains clues to its function. Functional prediction from sequence presents a challenge particularly for proteins that have low or no sequence similarity to proteins of known function. Recently, machine learning methods have been explored for predicting functional class of proteins from sequence-derived properties independent of sequence similarity, which showed promising potential for low- and non-homologous proteins. These methods can thus be explored as potential tools to complement alignment- and clustering-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using machine learning methods for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented, which need to be interpreted with caution as they are dependent on such factors as datasets used and choice of parameters.",
author = "Lianyi Han and Juan Cui and Honghuang Lin and Zhiliang Ji and Zhiwei Cao and Yixue Li and Yuzong Chen",
year = "2006",
month = "1",
day = "1",
doi = "10.1002/pmic.200500938",
language = "English (US)",
volume = "6",
pages = "4023--4037",
journal = "Proteomics",
issn = "1615-9853",
publisher = "Wiley-VCH Verlag",
number = "14",

}

TY - JOUR

T1 - Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity.

AU - Han, Lianyi

AU - Cui, Juan

AU - Lin, Honghuang

AU - Ji, Zhiliang

AU - Cao, Zhiwei

AU - Li, Yixue

AU - Chen, Yuzong

PY - 2006/1/1

Y1 - 2006/1/1

N2 - Protein sequence contains clues to its function. Functional prediction from sequence presents a challenge particularly for proteins that have low or no sequence similarity to proteins of known function. Recently, machine learning methods have been explored for predicting functional class of proteins from sequence-derived properties independent of sequence similarity, which showed promising potential for low- and non-homologous proteins. These methods can thus be explored as potential tools to complement alignment- and clustering-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using machine learning methods for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented, which need to be interpreted with caution as they are dependent on such factors as datasets used and choice of parameters.

AB - Protein sequence contains clues to its function. Functional prediction from sequence presents a challenge particularly for proteins that have low or no sequence similarity to proteins of known function. Recently, machine learning methods have been explored for predicting functional class of proteins from sequence-derived properties independent of sequence similarity, which showed promising potential for low- and non-homologous proteins. These methods can thus be explored as potential tools to complement alignment- and clustering-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using machine learning methods for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented, which need to be interpreted with caution as they are dependent on such factors as datasets used and choice of parameters.

UR - http://www.scopus.com/inward/record.url?scp=33750125547&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33750125547&partnerID=8YFLogxK

U2 - 10.1002/pmic.200500938

DO - 10.1002/pmic.200500938

M3 - Review article

VL - 6

SP - 4023

EP - 4037

JO - Proteomics

JF - Proteomics

SN - 1615-9853

IS - 14

ER -