A Multitask Approach to Learn Molecular Properties

Zheng Tan; Yan Li; Weimei Shi; Shiqing Yang

doi:10.1021/acs.jcim.1c00646

A Multitask Approach to Learn Molecular Properties

J Chem Inf Model. 2021 Aug 23;61(8):3824-3834. doi: 10.1021/acs.jcim.1c00646. Epub 2021 Jul 21.

Authors

Zheng Tan¹, Yan Li², Weimei Shi¹, Shiqing Yang¹

Affiliations

¹ Chengdu Polytechnic, 83 Tianyi Street, Chengdu, Sichuan 610000, P. R. China.
² Xiyuan Quantitative Technology, 388 Yizhou Road, Chengdu, Sichuan 610000, P. R. China.

PMID: 34289687
DOI: 10.1021/acs.jcim.1c00646

Abstract

The endeavors to pursue a robust multitask model to resolve intertask correlations have lasted for many years. A multitask deep neural network, as the most widely used multitask framework, however, experiences several issues such as inconsistent performance improvement over the independent model benchmark. The research aims to introduce an alternative framework by using the problem transformation methods. We build our multitask models essentially based on the stacking of a base regressor and classifier, where the multitarget predictions are realized from an additional training stage on the expanded molecular feature space. The model architecture is implemented on the QM9, Alchemy, and Tox21 datasets, by using a variety of baseline machine learning techniques. The resultant multitask performance shows 1 to 10% enhancement of forecasting precision, with the task prediction accuracy being consistently improved over the independent single-target models. The proposed method demonstrates a notable superiority in tackling the intertarget dependence and, moreover, a great potential to simulate a wide range of molecular properties under the transformation framework.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Benchmarking
Machine Learning*
Neural Networks, Computer*
Research Design