Utilizing public and private sector data to build better machine learning models for the prediction of pharmacokinetic parameters

Drug Discov Today. 2022 Nov;27(11):103339. doi: 10.1016/j.drudis.2022.103339. Epub 2022 Aug 13.

Abstract

One solution to compensate for the shortage of publicly available data is to collect more quality-controlled data from the private sector through public-private partnerships. However, several issues must be resolved before implementing such a system. Here, we review the technical aspects of public-private partnerships using our initiative in Japan as an example. In particular, we focus on the procedure for collecting data from multiple private sector companies and building prediction models and discuss how merging public and private sector datasets will help to improve the chemical space coverage and prediction performance. Teaser: Japan's first public-private consortium in pharmacokinetics has incorporated data from multiple pharmaceutical companies to create useful predictive models.

Keywords: Data collection; Database; Descriptor; Machine learning; Pharmacokinetic prediction; Prediction model.