Linear-Scaling Local Natural Orbital CCSD(T) Approach for Open-Shell Systems: Algorithms, Benchmarks, and Large-Scale Applications

J Chem Theory Comput. 2023 Nov 28;19(22):8166-8188. doi: 10.1021/acs.jctc.3c00881. Epub 2023 Nov 3.

Abstract

The extension of the highly optimized local natural orbital (LNO) coupled cluster (CC) with single-, double-, and perturbative triple excitations [LNO-CCSD(T)] method is presented for high-spin open-shell molecules based on restricted open-shell references. The techniques enabling the outstanding efficiency of the closed-shell LNO-CCSD(T) variant are adopted, including the iteration- and redundancy-free second-order Møller-Plesset and (T) formulations as well as the integral-direct, memory- and disk use-economic, and OpenMP-parallel algorithms. For large molecules, the efficiency of our open-shell LNO-CCSD(T) method approaches that of its closed-shell parent method due to the application of restricted orbital sets for demanding integral transformations and a novel approximation for higher-order long-range spin-polarization effects. The accuracy of open-shell LNO-CCSD(T) is extensively tested for radicals and reactions thereof, ionization processes, as well as spin-state splittings, and transition-metal compounds. At the size range where the canonical CCSD(T) reference is accessible (up to 20-30 atoms), the average open-shell LNO-CCSD(T) correlation energies are found to be 99.9 to 99.95% accurate, which translates into average absolute deviations of a few tenths of kcal/mol in the investigated energy differences already with the default settings. For more extensive molecules, the local errors may grow, but they can be estimated and decreased via affordable systematic convergence studies. This enables the accurate modeling of large systems with complex electronic structures, as illustrated on open-shell organic radicals and transition-metal complexes of up to 179 atoms as well as on challenging biochemical systems, including up to 601 atoms and 11,000 basis functions. While the protein models involve difficulties for local approximations, such as the spin states of a bounded iron ion or an extremely delocalized singly occupied orbital, the corresponding single-node LNO-CCSD(T) computations were feasible in a matter of days with 10s to 100 GB of memory use. Therefore, the new LNO-CCSD(T) implementation enables highly accurate computations for open-shell systems of unprecedented size and complexity with widely accessible hardware.