Mining Cytochrome b561 proteins from plant genomes

Stephen O. Opiyo, Etsuko N. Moriyama

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Cytochrome b561 (Cyt-b561) proteins are important for plant growth, development, and prevention of damage to plants. Because of their high sequence divergence, thorough mining of Cyt-b561 proteins from plant genomes are not easy. Currently there is only one Cyt-b561 gene found in the maize and none in the soybean genome. However, 22 have been identified in the Arabidopsis thaliana genome. We tested alignment-free protein classifiers based on partial least squares (PLS) and support vector machines to identify Cyt-b561. These classifiers performed better than profile hidden Markov models and PSI-BLAST. Using these classifiers we identified new Cyt-b561-related proteins from four plant genomes.

Original languageEnglish (US)
Pages (from-to)209-221
Number of pages13
JournalInternational Journal of Bioinformatics Research and Applications
Volume6
Issue number2
DOIs
StatePublished - Jan 1 2010

Fingerprint

Plant Genome
Plant Proteins
Genes
Proteins
Classifiers
Genome
Plant Development
Hidden Markov models
Least-Squares Analysis
Growth and Development
Soybeans
Arabidopsis
Zea mays
Support vector machines
cytochrome b561

Keywords

  • Cytochrome b561
  • PLS
  • PSI-BLAST
  • SVMs
  • partial least squares
  • profile hidden Markov model
  • support vector machines

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics
  • Clinical Biochemistry
  • Health Information Management

Cite this

Mining Cytochrome b561 proteins from plant genomes. / Opiyo, Stephen O.; Moriyama, Etsuko N.

In: International Journal of Bioinformatics Research and Applications, Vol. 6, No. 2, 01.01.2010, p. 209-221.

Research output: Contribution to journalArticle

@article{fe1627c048d940f080084a13271c07ae,
title = "Mining Cytochrome b561 proteins from plant genomes",
abstract = "Cytochrome b561 (Cyt-b561) proteins are important for plant growth, development, and prevention of damage to plants. Because of their high sequence divergence, thorough mining of Cyt-b561 proteins from plant genomes are not easy. Currently there is only one Cyt-b561 gene found in the maize and none in the soybean genome. However, 22 have been identified in the Arabidopsis thaliana genome. We tested alignment-free protein classifiers based on partial least squares (PLS) and support vector machines to identify Cyt-b561. These classifiers performed better than profile hidden Markov models and PSI-BLAST. Using these classifiers we identified new Cyt-b561-related proteins from four plant genomes.",
keywords = "Cytochrome b561, PLS, PSI-BLAST, SVMs, partial least squares, profile hidden Markov model, support vector machines",
author = "Opiyo, {Stephen O.} and Moriyama, {Etsuko N.}",
year = "2010",
month = "1",
day = "1",
doi = "10.1504/IJBRA.2010.032122",
language = "English (US)",
volume = "6",
pages = "209--221",
journal = "International Journal of Bioinformatics Research and Applications",
issn = "1744-5485",
publisher = "Inderscience Enterprises Ltd",
number = "2",

}

TY - JOUR

T1 - Mining Cytochrome b561 proteins from plant genomes

AU - Opiyo, Stephen O.

AU - Moriyama, Etsuko N.

PY - 2010/1/1

Y1 - 2010/1/1

N2 - Cytochrome b561 (Cyt-b561) proteins are important for plant growth, development, and prevention of damage to plants. Because of their high sequence divergence, thorough mining of Cyt-b561 proteins from plant genomes are not easy. Currently there is only one Cyt-b561 gene found in the maize and none in the soybean genome. However, 22 have been identified in the Arabidopsis thaliana genome. We tested alignment-free protein classifiers based on partial least squares (PLS) and support vector machines to identify Cyt-b561. These classifiers performed better than profile hidden Markov models and PSI-BLAST. Using these classifiers we identified new Cyt-b561-related proteins from four plant genomes.

AB - Cytochrome b561 (Cyt-b561) proteins are important for plant growth, development, and prevention of damage to plants. Because of their high sequence divergence, thorough mining of Cyt-b561 proteins from plant genomes are not easy. Currently there is only one Cyt-b561 gene found in the maize and none in the soybean genome. However, 22 have been identified in the Arabidopsis thaliana genome. We tested alignment-free protein classifiers based on partial least squares (PLS) and support vector machines to identify Cyt-b561. These classifiers performed better than profile hidden Markov models and PSI-BLAST. Using these classifiers we identified new Cyt-b561-related proteins from four plant genomes.

KW - Cytochrome b561

KW - PLS

KW - PSI-BLAST

KW - SVMs

KW - partial least squares

KW - profile hidden Markov model

KW - support vector machines

UR - http://www.scopus.com/inward/record.url?scp=77953630985&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77953630985&partnerID=8YFLogxK

U2 - 10.1504/IJBRA.2010.032122

DO - 10.1504/IJBRA.2010.032122

M3 - Article

C2 - 20223741

AN - SCOPUS:77953630985

VL - 6

SP - 209

EP - 221

JO - International Journal of Bioinformatics Research and Applications

JF - International Journal of Bioinformatics Research and Applications

SN - 1744-5485

IS - 2

ER -