A deep learning model for predicting selected organic molecular spectra

Nat Comput Sci. 2023 Nov;3(11):957-964. doi: 10.1038/s43588-023-00550-y. Epub 2023 Nov 13.

Abstract

Accurate and efficient molecular spectra simulations are crucial for substance discovery and structure identification. However, the conventional approach of relying on the quantum chemistry is cost intensive, which hampers efficiency. Here we develop DetaNet, a deep-learning model combining E(3)-equivariance group and self-attention mechanism to predict molecular spectra with improved efficiency and accuracy. By passing high-order geometric tensorial messages, DetaNet is able to generate a wide variety of molecular properties, including scalars, vectors, and second- and third-order tensors-all at the accuracy of quantum chemistry calculations. Based on this we developed generalized modules to predict four important types of molecular spectra, namely infrared, Raman, ultraviolet-visible, and 1H and 13C nuclear magnetic resonance, taking the QM9S dataset containing 130,000 molecular species as an example. By speeding up the prediction of molecular spectra at quantum chemical accuracy, DetaNet could help progress toward real-time structural identification using spectroscopic measurements.

MeSH terms

  • Deep Learning*
  • Magnetic Resonance Spectroscopy
  • Models, Molecular
  • Quantum Theory
  • Spectrophotometry, Ultraviolet