Comparative study of topological indices of macro/supramolecular RNA complex networks

J Chem Inf Model. 2008 Nov;48(11):2265-77. doi: 10.1021/ci8001809.

Abstract

RNA function annotation is often based on alignment to a previously studied template. In contrast to the study of proteins, there are not many alignment-free methods to predict RNA functions if alignment fails. The use of topological indices (TIs) of RNA complex networks (CNs) to find quantitative structure-activity relationships (QSAR) may be an alternative to incorporate secondary structure or sequence-to-sequence similarity. Here, we introduce new QSAR-like techniques using RNA macromolecular CNs (mmCNs), where nodes are nucleotides, or RNA supramolecular CNs (smCNs), where nodes are RNA sequences. We studied a data set of 198 sequences including 18S-rRNAs (important phylogenetic molecular biomarkers). We constructed three types of RNA mmCNs: sequence-linear (SL), Cartesian-lattice (CL), and sequence-folding CNs (SF-CNs) and two smCNs: sequence-sequence disagreement CN (SSD) and sequence-sequence similarity (SSS-smCN). We reported the first comparative QSAR study with all these CIs and CNs, which includes: (i) spectral moments ( ( i )micro d ( w)) of SL-mmCNs (accuracy = 75.3%), (ii) electrostatic CIs (xi d ) of CL-mmCNs (>90%), (iii) thermodynamic parameters (Delta G, Delta H, Delta S, and T m) of SF-mmCNs (64.7%), (iv) disagreement-distribution moments ( M k ) of the SSD-smCN (79.3%), and (v) node centralities of the SSD-smCN (78.0%). Furthermore, we reported the experimental isolation of a new RNA sequence from Psidum guajava leaf tissue and its QSAR and BLAST prediction to illustrate the practical use of these methods. We also investigated the use of these CNs to explore rRNA diversity on bacteria, plants, and parasites from the Dactylogyrus genus. The HPL-mmCNs model was the best of all found. All the CNs and TIs, except SF-mmCNs, were introduced here by the first time for the QSAR study of RNA, which allowed a comparative study for RNA classification.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computer Simulation
  • Databases, Nucleic Acid
  • Genetic Variation
  • Informatics
  • Macromolecular Substances / chemistry
  • Models, Molecular
  • Nucleic Acid Conformation
  • Platyhelminths / chemistry
  • Platyhelminths / genetics
  • Psidium / genetics
  • Quantitative Structure-Activity Relationship
  • RNA / chemistry*
  • RNA / classification
  • RNA / genetics
  • RNA, Bacterial / chemistry
  • RNA, Bacterial / genetics
  • RNA, Helminth / chemistry
  • RNA, Helminth / genetics
  • RNA, Plant / chemistry
  • RNA, Plant / genetics
  • RNA, Ribosomal, 18S / chemistry
  • RNA, Ribosomal, 18S / genetics
  • Static Electricity
  • Thermodynamics

Substances

  • Macromolecular Substances
  • RNA, Bacterial
  • RNA, Helminth
  • RNA, Plant
  • RNA, Ribosomal, 18S
  • RNA