Deep neural network model for highly accurate prediction of BODIPYs absorption

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Feb 15;267(Pt 2):120577. doi: 10.1016/j.saa.2021.120577. Epub 2021 Nov 3.

Abstract

A possibility to accurately predict the absorption maximum wavelength of BODIPYs was investigated. We found that previously reported models had a low accuracy (40-57 nm) to predict BODIPYs due to the limited dataset sizes and/or number of BODIPYs (few hundreds). New models developed in this study were based on data of 6000-plus fluorescent dyes (including 4000-plus BODIPYs) and the deep neural network architecture. The high prediction accuracy (five-fold cross-validation room mean squared error (RMSE) of 18.4 nm) was obtained using a consensus model, which was more accurate than individual models. This model provided the excellent accuracy (RMSE of 8 nm) for molecules previously synthesized in our laboratory as well as for prospective validation of three new BODIPYs. We found that solvent properties did not significantly influence the model accuracy since only few BODIPYs exhibited solvatochromism. The analysis of large prediction errors suggested that compounds able to have intermolecular interactions with solvent or salts were likely to be incorrectly predicted. The consensus model is freely available at https://ochem.eu/article/134921 and can help the other researchers to accelerate design of new dyes with desired properties.

Keywords: Absorption maximum wavelength; BODIPY; Deep neural networks; OCHEM; QSPR.

MeSH terms

  • Boron Compounds*
  • Crystallography, X-Ray
  • Fluorescent Dyes*
  • Neural Networks, Computer

Substances

  • 4,4-difluoro-4-bora-3a,4a-diaza-s-indacene
  • Boron Compounds
  • Fluorescent Dyes