MIMOSA: Algorithms for Microbial Profiling

O. Ufuk Nalbantoglu, Khalid Sayood

Research output: Contribution to journalArticle

Abstract

A significant goal of the study of metagenomes obtained from an environment is to find the microbial diversity and the abundance of each organism in the community. Phylotyping and binning methods which address this problem generally operate using either marker sequences or by classifying each genome fragment individually. However, these approaches might not use all the information contained in the metagenome. We propose an approach based on a Multiple Input Multiple Output (MIMO) communication system model. Results from two different implementations of this approach, one using DNA-DNA hybridization simulations and one using short read mapping are evaluated using simulated and actual metagenomes and compared with other methods of phylotyping. The proposed approaches generally performed better under different scenarios including pathogen detection tasks of community complexity and low and high sequencing coverage while being highly computationally effective. The resulting framework can be integrated to metagenome analysis pipelines for phylogenetic diversity estimation. The approach is modular so that techniques other than hybridization simulations and short read mapping may be integrated. We have observed that even for low coverage samples, the method provides accurate estimates. Therefore, the use of the proposed strategy could enable the task of exploring biodiversity with limited resources.

Original languageEnglish (US)
Article number8350295
Pages (from-to)2023-2034
Number of pages12
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume16
Issue number6
DOIs
StatePublished - Nov 1 2019

Fingerprint

Metagenome
Profiling
DNA
Biodiversity
Coverage
Pathogens
Binning
Communication systems
Multiple-input multiple-output (MIMO) Systems
Pipelines
Genes
Phylogenetics
Sequencing
Communication Systems
Fragment
Simulation
Genome
Scenarios
Resources
Estimate

Keywords

  • Metagenomics
  • microbial diversity estimation
  • phylotyping
  • sequence analysis
  • sparse recovery algorithms

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics

Cite this

MIMOSA : Algorithms for Microbial Profiling. / Nalbantoglu, O. Ufuk; Sayood, Khalid.

In: IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 16, No. 6, 8350295, 01.11.2019, p. 2023-2034.

Research output: Contribution to journalArticle

@article{56f584be17d64f25acfb397652ef9fcb,
title = "MIMOSA: Algorithms for Microbial Profiling",
abstract = "A significant goal of the study of metagenomes obtained from an environment is to find the microbial diversity and the abundance of each organism in the community. Phylotyping and binning methods which address this problem generally operate using either marker sequences or by classifying each genome fragment individually. However, these approaches might not use all the information contained in the metagenome. We propose an approach based on a Multiple Input Multiple Output (MIMO) communication system model. Results from two different implementations of this approach, one using DNA-DNA hybridization simulations and one using short read mapping are evaluated using simulated and actual metagenomes and compared with other methods of phylotyping. The proposed approaches generally performed better under different scenarios including pathogen detection tasks of community complexity and low and high sequencing coverage while being highly computationally effective. The resulting framework can be integrated to metagenome analysis pipelines for phylogenetic diversity estimation. The approach is modular so that techniques other than hybridization simulations and short read mapping may be integrated. We have observed that even for low coverage samples, the method provides accurate estimates. Therefore, the use of the proposed strategy could enable the task of exploring biodiversity with limited resources.",
keywords = "Metagenomics, microbial diversity estimation, phylotyping, sequence analysis, sparse recovery algorithms",
author = "Nalbantoglu, {O. Ufuk} and Khalid Sayood",
year = "2019",
month = "11",
day = "1",
doi = "10.1109/TCBB.2018.2830324",
language = "English (US)",
volume = "16",
pages = "2023--2034",
journal = "IEEE/ACM Transactions on Computational Biology and Bioinformatics",
issn = "1545-5963",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "6",

}

TY - JOUR

T1 - MIMOSA

T2 - Algorithms for Microbial Profiling

AU - Nalbantoglu, O. Ufuk

AU - Sayood, Khalid

PY - 2019/11/1

Y1 - 2019/11/1

N2 - A significant goal of the study of metagenomes obtained from an environment is to find the microbial diversity and the abundance of each organism in the community. Phylotyping and binning methods which address this problem generally operate using either marker sequences or by classifying each genome fragment individually. However, these approaches might not use all the information contained in the metagenome. We propose an approach based on a Multiple Input Multiple Output (MIMO) communication system model. Results from two different implementations of this approach, one using DNA-DNA hybridization simulations and one using short read mapping are evaluated using simulated and actual metagenomes and compared with other methods of phylotyping. The proposed approaches generally performed better under different scenarios including pathogen detection tasks of community complexity and low and high sequencing coverage while being highly computationally effective. The resulting framework can be integrated to metagenome analysis pipelines for phylogenetic diversity estimation. The approach is modular so that techniques other than hybridization simulations and short read mapping may be integrated. We have observed that even for low coverage samples, the method provides accurate estimates. Therefore, the use of the proposed strategy could enable the task of exploring biodiversity with limited resources.

AB - A significant goal of the study of metagenomes obtained from an environment is to find the microbial diversity and the abundance of each organism in the community. Phylotyping and binning methods which address this problem generally operate using either marker sequences or by classifying each genome fragment individually. However, these approaches might not use all the information contained in the metagenome. We propose an approach based on a Multiple Input Multiple Output (MIMO) communication system model. Results from two different implementations of this approach, one using DNA-DNA hybridization simulations and one using short read mapping are evaluated using simulated and actual metagenomes and compared with other methods of phylotyping. The proposed approaches generally performed better under different scenarios including pathogen detection tasks of community complexity and low and high sequencing coverage while being highly computationally effective. The resulting framework can be integrated to metagenome analysis pipelines for phylogenetic diversity estimation. The approach is modular so that techniques other than hybridization simulations and short read mapping may be integrated. We have observed that even for low coverage samples, the method provides accurate estimates. Therefore, the use of the proposed strategy could enable the task of exploring biodiversity with limited resources.

KW - Metagenomics

KW - microbial diversity estimation

KW - phylotyping

KW - sequence analysis

KW - sparse recovery algorithms

UR - http://www.scopus.com/inward/record.url?scp=85046338783&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85046338783&partnerID=8YFLogxK

U2 - 10.1109/TCBB.2018.2830324

DO - 10.1109/TCBB.2018.2830324

M3 - Article

C2 - 29994027

AN - SCOPUS:85046338783

VL - 16

SP - 2023

EP - 2034

JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics

JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics

SN - 1545-5963

IS - 6

M1 - 8350295

ER -