A systematic approach to RNA-associated motif discovery

Tian Gao, Jiang Shu, Juan Cui

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Background: Sequencing-based large screening of RNA-protein and RNA-RNA interactions has enabled the mechanistic study of post-transcriptional RNA processing and sorting, including exosome-mediated RNA secretion. The downstream analysis of RNA binding sites has encouraged the investigation of novel sequence motifs, which resulted in exceptional new challenges for identifying motifs from very short sequences (e.g., small non-coding RNAs or truncated messenger RNAs), where conventional methods tend to be ineffective. To address these challenges, we propose a novel motif-finding method and validate it on a wide range of RNA applications. Results: We first perform motif analysis on microRNAs and longer RNA fragments from various cellular and exosomal sources, and then validate our prediction through literature search and experimental test. For example, a 4 bp-long motif, GUUG, was detected to be responsible for microRNA loading in exosomes involved in human colon cancer (SW620). Additional performance comparisons in various case studies have shown that this new approach outperforms several existing state-of-the-art methods in detecting motifs with exceptional high coverage and explicitness. Conclusions: In this work, we have demonstrated the promising performance of a new motif discovery approach that is particularly effective in current RNA applications. Important discoveries resulting from this work include the identification of possible RNA-loading motifs in a variety of exosomes, as well as novel insights in sequence features of RNA cargos, i.e., short non-coding RNAs and messenger RNAs may share similar loading mechanism into exosomes. This method has been implemented and deployed as a new webserver named MDS2 which is accessible at http://sbbi-panda.unl.edu/MDS2/ , along with a standalone package available for download at https://github.com/sbbi/MDS2.

Original languageEnglish (US)
Article number146
JournalBMC genomics
Volume19
Issue number1
DOIs
StatePublished - Feb 14 2018

Fingerprint

Nucleotide Motifs
RNA
Exosomes
MicroRNAs
Post Transcriptional RNA Processing
Exosome Multienzyme Ribonuclease Complex
RNA Transport
Small Untranslated RNA
Untranslated RNA
Messenger RNA
Colonic Neoplasms
Binding Sites

Keywords

  • Exosomal RNAs
  • Exosomes
  • Graph algorithms
  • MicroRNAs
  • Motif finding
  • Short sequences

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

A systematic approach to RNA-associated motif discovery. / Gao, Tian; Shu, Jiang; Cui, Juan.

In: BMC genomics, Vol. 19, No. 1, 146, 14.02.2018.

Research output: Contribution to journalArticle

Gao, Tian ; Shu, Jiang ; Cui, Juan. / A systematic approach to RNA-associated motif discovery. In: BMC genomics. 2018 ; Vol. 19, No. 1.
@article{1668125d60174b36a1828231d4075169,
title = "A systematic approach to RNA-associated motif discovery",
abstract = "Background: Sequencing-based large screening of RNA-protein and RNA-RNA interactions has enabled the mechanistic study of post-transcriptional RNA processing and sorting, including exosome-mediated RNA secretion. The downstream analysis of RNA binding sites has encouraged the investigation of novel sequence motifs, which resulted in exceptional new challenges for identifying motifs from very short sequences (e.g., small non-coding RNAs or truncated messenger RNAs), where conventional methods tend to be ineffective. To address these challenges, we propose a novel motif-finding method and validate it on a wide range of RNA applications. Results: We first perform motif analysis on microRNAs and longer RNA fragments from various cellular and exosomal sources, and then validate our prediction through literature search and experimental test. For example, a 4 bp-long motif, GUUG, was detected to be responsible for microRNA loading in exosomes involved in human colon cancer (SW620). Additional performance comparisons in various case studies have shown that this new approach outperforms several existing state-of-the-art methods in detecting motifs with exceptional high coverage and explicitness. Conclusions: In this work, we have demonstrated the promising performance of a new motif discovery approach that is particularly effective in current RNA applications. Important discoveries resulting from this work include the identification of possible RNA-loading motifs in a variety of exosomes, as well as novel insights in sequence features of RNA cargos, i.e., short non-coding RNAs and messenger RNAs may share similar loading mechanism into exosomes. This method has been implemented and deployed as a new webserver named MDS2 which is accessible at http://sbbi-panda.unl.edu/MDS2/ , along with a standalone package available for download at https://github.com/sbbi/MDS2.",
keywords = "Exosomal RNAs, Exosomes, Graph algorithms, MicroRNAs, Motif finding, Short sequences",
author = "Tian Gao and Jiang Shu and Juan Cui",
year = "2018",
month = "2",
day = "14",
doi = "10.1186/s12864-018-4528-x",
language = "English (US)",
volume = "19",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - A systematic approach to RNA-associated motif discovery

AU - Gao, Tian

AU - Shu, Jiang

AU - Cui, Juan

PY - 2018/2/14

Y1 - 2018/2/14

N2 - Background: Sequencing-based large screening of RNA-protein and RNA-RNA interactions has enabled the mechanistic study of post-transcriptional RNA processing and sorting, including exosome-mediated RNA secretion. The downstream analysis of RNA binding sites has encouraged the investigation of novel sequence motifs, which resulted in exceptional new challenges for identifying motifs from very short sequences (e.g., small non-coding RNAs or truncated messenger RNAs), where conventional methods tend to be ineffective. To address these challenges, we propose a novel motif-finding method and validate it on a wide range of RNA applications. Results: We first perform motif analysis on microRNAs and longer RNA fragments from various cellular and exosomal sources, and then validate our prediction through literature search and experimental test. For example, a 4 bp-long motif, GUUG, was detected to be responsible for microRNA loading in exosomes involved in human colon cancer (SW620). Additional performance comparisons in various case studies have shown that this new approach outperforms several existing state-of-the-art methods in detecting motifs with exceptional high coverage and explicitness. Conclusions: In this work, we have demonstrated the promising performance of a new motif discovery approach that is particularly effective in current RNA applications. Important discoveries resulting from this work include the identification of possible RNA-loading motifs in a variety of exosomes, as well as novel insights in sequence features of RNA cargos, i.e., short non-coding RNAs and messenger RNAs may share similar loading mechanism into exosomes. This method has been implemented and deployed as a new webserver named MDS2 which is accessible at http://sbbi-panda.unl.edu/MDS2/ , along with a standalone package available for download at https://github.com/sbbi/MDS2.

AB - Background: Sequencing-based large screening of RNA-protein and RNA-RNA interactions has enabled the mechanistic study of post-transcriptional RNA processing and sorting, including exosome-mediated RNA secretion. The downstream analysis of RNA binding sites has encouraged the investigation of novel sequence motifs, which resulted in exceptional new challenges for identifying motifs from very short sequences (e.g., small non-coding RNAs or truncated messenger RNAs), where conventional methods tend to be ineffective. To address these challenges, we propose a novel motif-finding method and validate it on a wide range of RNA applications. Results: We first perform motif analysis on microRNAs and longer RNA fragments from various cellular and exosomal sources, and then validate our prediction through literature search and experimental test. For example, a 4 bp-long motif, GUUG, was detected to be responsible for microRNA loading in exosomes involved in human colon cancer (SW620). Additional performance comparisons in various case studies have shown that this new approach outperforms several existing state-of-the-art methods in detecting motifs with exceptional high coverage and explicitness. Conclusions: In this work, we have demonstrated the promising performance of a new motif discovery approach that is particularly effective in current RNA applications. Important discoveries resulting from this work include the identification of possible RNA-loading motifs in a variety of exosomes, as well as novel insights in sequence features of RNA cargos, i.e., short non-coding RNAs and messenger RNAs may share similar loading mechanism into exosomes. This method has been implemented and deployed as a new webserver named MDS2 which is accessible at http://sbbi-panda.unl.edu/MDS2/ , along with a standalone package available for download at https://github.com/sbbi/MDS2.

KW - Exosomal RNAs

KW - Exosomes

KW - Graph algorithms

KW - MicroRNAs

KW - Motif finding

KW - Short sequences

UR - http://www.scopus.com/inward/record.url?scp=85042111235&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85042111235&partnerID=8YFLogxK

U2 - 10.1186/s12864-018-4528-x

DO - 10.1186/s12864-018-4528-x

M3 - Article

VL - 19

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

IS - 1

M1 - 146

ER -