Analysis and Identification of Aptamer-Compound Interactions with a Maximum Relevance Minimum Redundancy and Nearest Neighbor Algorithm

Biomed Res Int. 2016:2016:8351204. doi: 10.1155/2016/8351204. Epub 2016 Feb 3.

Abstract

The development of biochemistry and molecular biology has revealed an increasingly important role of compounds in several biological processes. Like the aptamer-protein interaction, aptamer-compound interaction attracts increasing attention. However, it is time-consuming to select proper aptamers against compounds using traditional methods, such as exponential enrichment. Thus, there is an urgent need to design effective computational methods for searching effective aptamers against compounds. This study attempted to extract important features for aptamer-compound interactions using feature selection methods, such as Maximum Relevance Minimum Redundancy, as well as incremental feature selection. Each aptamer-compound pair was represented by properties derived from the aptamer and compound, including frequencies of single nucleotides and dinucleotides for the aptamer, as well as the constitutional, electrostatic, quantum-chemical, and space conformational descriptors of the compounds. As a result, some important features were obtained. To confirm the importance of the obtained features, we further discussed the associations between them and aptamer-compound interactions. Simultaneously, an optimal prediction model based on the nearest neighbor algorithm was built to identify aptamer-compound interactions, which has the potential to be a useful tool for the identification of novel aptamer-compound interactions. The program is available upon the request.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Aptamers, Nucleotide / chemistry*
  • Cluster Analysis
  • Computational Biology / methods*
  • Models, Theoretical*
  • Molecular Conformation
  • Proteins / chemistry*
  • Quantum Theory

Substances

  • Aptamers, Nucleotide
  • Proteins