Mining the Arabidopsis and rice genomes for cyclophilin protein families

S. O. Opiyo, E. N. Moriyama

Research output: Contribution to journalArticle

8 Scopus citations

Abstract

Cyclophilins, which possess peptidyl-prolyl isomerase activity, are cellular targets of immunosuppressant drugs and involved in a wide variety of functions. While the Arabidopsis thaliana genome contains the largest number of cyclophilins, the number of plant cyclophilins available in databases is small compared to that of other organisms. It implies that many cyclophilins are yet to be identified in plants. In order to identify cyclophilin candidates from available plant sequence data, we examined alignment-free methods based on Partial Least Squares (PLS). PLS classifier performed better than profile hidden Markov models and PSI-BLAST in identifying cyclophilins from the Arabidopsis and rice genomes.

Original languageEnglish (US)
Pages (from-to)295-309
Number of pages15
JournalInternational Journal of Bioinformatics Research and Applications
Volume5
Issue number3
DOIs
Publication statusPublished - Jun 1 2009

    Fingerprint

Keywords

  • Bioinformatics
  • Cyclophilins
  • PLS
  • Partial least squares
  • Profile hidden Markov model

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics
  • Clinical Biochemistry
  • Health Information Management

Cite this