Frequency and hydrogen bonding of nucleobase homopairs in small molecule crystals

Nucleic Acids Res. 2020 Sep 4;48(15):8302-8319. doi: 10.1093/nar/gkaa629.

Abstract

We used the high resolution and accuracy of the Cambridge Structural Database (CSD) to provide detailed information regarding base pairing interactions of selected nucleobases. We searched for base pairs in which nucleobases interact with each other through two or more hydrogen bonds and form more or less planar structures. The investigated compounds were either free forms or derivatives of adenine, guanine, hypoxanthine, thymine, uracil and cytosine. We divided our findings into categories including types of pairs, protonation patterns and whether they are formed by free bases or substituted ones. We found base pair types that are exclusive to small molecule crystal structures, some that can be found only in RNA containing crystal structures and many that are native to both environments. With a few exceptions, nucleobase protonation generally followed a standard pattern governed by pKa values. The lengths of hydrogen bonds did not depend on whether the nucleobases forming a base pair were charged or not. The reasons why particular nucleobases formed base pairs in a certain way varied significantly.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenine / chemistry
  • Base Pairing / genetics
  • Crystallography, X-Ray
  • Cytosine / chemistry
  • Databases, Protein*
  • Guanine / chemistry
  • Hydrogen Bonding*
  • Hypoxanthine / chemistry
  • Molecular Structure
  • Protein Conformation*
  • Proteins / chemistry
  • Proteins / genetics*
  • Proteins / ultrastructure
  • Small Molecule Libraries / chemistry
  • Thymine / chemistry
  • Uracil / chemistry

Substances

  • Proteins
  • Small Molecule Libraries
  • Hypoxanthine
  • Uracil
  • Guanine
  • Cytosine
  • Adenine
  • Thymine