Plant microRNA-target interaction identification model based on the integration of prediction tools and support vector machine

PLoS One. 2014 Jul 22;9(7):e103181. doi: 10.1371/journal.pone.0103181. eCollection 2014.

Abstract

Background: Confident identification of microRNA-target interactions is significant for studying the function of microRNA (miRNA). Although some computational miRNA target prediction methods have been proposed for plants, results of various methods tend to be inconsistent and usually lead to more false positive. To address these issues, we developed an integrated model for identifying plant miRNA-target interactions.

Results: Three online miRNA target prediction toolkits and machine learning algorithms were integrated to identify and analyze Arabidopsis thaliana miRNA-target interactions. Principle component analysis (PCA) feature extraction and self-training technology were introduced to improve the performance. Results showed that the proposed model outperformed the previously existing methods. The results were validated by using degradome sequencing supported Arabidopsis thaliana miRNA-target interactions. The proposed model constructed on Arabidopsis thaliana was run over Oryza sativa and Vitis vinifera to demonstrate that our model is effective for other plant species.

Conclusions: The integrated model of online predictors and local PCA-SVM classifier gained credible and high quality miRNA-target interactions. The supervised learning algorithm of PCA-SVM classifier was employed in plant miRNA target identification for the first time. Its performance can be substantially improved if more experimentally proved training samples are provided.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Arabidopsis / genetics*
  • Gene Expression Regulation, Plant*
  • MicroRNAs / genetics*
  • Models, Genetic*
  • Oryza / genetics*
  • RNA, Plant / genetics
  • Support Vector Machine*
  • Vitis / genetics*

Substances

  • MicroRNAs
  • RNA, Plant

Grants and funding

This work and the article processing charge were supported by grants from the National Natural Science Foundation of China (No. 31272167), the Natural Science Foundation of Liaoning Province of China (No. 20130200029). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.