Generation of an accurate CCSD(T)/CBS data set and assessment of DFT methods for the binding strengths of group I metal-nucleic acid complexes

Front Chem. 2023 Nov 20:11:1296787. doi: 10.3389/fchem.2023.1296787. eCollection 2023.

Abstract

Accurate information about interactions between group I metals and nucleic acids is required to understand the roles these metals play in basic cellular functions, disease progression, and pharmaceuticals, as well as to aid the design of new energy storage materials and nucleic acid sensors that target metal contaminants, among other applications. From this perspective, this work generates a complete CCSD(T)/CBS data set of the binding energies for 64 complexes involving each group I metal (Li+, Na+, K+, Rb+, or Cs+) directly coordinated to various sites in each nucleic acid component (A, C, G, T, U, or dimethylphosphate). This data have otherwise been challenging to determine experimentally, with highly accurate information missing for many group I metal-nucleic acid combinations and no data available for the (charged) phosphate moiety. Subsequently, the performance of 61 DFT methods in combination with def2-TZVPP is tested against the newly generated CCSD(T)/CBS reference values. Detailed analysis of the results reveals that functional performance is dependent on the identity of the metal (with increased errors as group I is descended) and nucleic acid binding site (with larger errors for select purine coordination sites). Over all complexes considered, the best methods include the mPW2-PLYP double-hybrid and ωB97M-V RSH functionals (≤1.6% MPE; <1.0 kcal/mol MUE). If more computationally efficient approaches are required, the TPSS and revTPSS local meta-GGA functionals are reasonable alternatives (≤2.0% MPE; <1.0 kcal/mol MUE). Inclusion of counterpoise corrections to account for basis set superposition error only marginally improves the computed binding energies, suggesting that these corrections can be neglected with little loss in accuracy when using larger models that are necessary for describing biosystems and biomaterials. Overall, the most accurate functionals identified in this study will permit future works geared towards uncovering the impact of group I metals on the environment and human biology, designing new ways to selectively sense harmful metals, engineering modern biomaterials, and developing improved computational methods to more broadly study group I metal-nucleic acid interactions.

Keywords: DNA; RNA; alkali metals; biomolecules; chemical structure; computational chemistry; interaction energies.

Grants and funding

The authors declare financial support was received for the research, authorship, and/or publication of this article. We thank the Natural Sciences and Engineering Research Council of Canada (NSERC) [2016-04568] for financial support and the Digital Research Alliance of Canada for providing computational resources. SW also thanks the Canada Research Chairs program [2021-00484], while BB thanks NSERC (PGS-D) and the University of Lethbridge for student scholarships.