A Data-Driven Approach to Predicting Tablet Properties after Accelerated Test Using Raw Material Property Database and Machine Learning

Chem Pharm Bull (Tokyo). 2023;71(6):406-415. doi: 10.1248/cpb.c22-00538.

Abstract

The purpose of this study was to develop a model for predicting tablet properties after an accelerated test and to determine whether molecular descriptors affect tablet properties. Tablets were prepared using 81 types of active pharmaceutical ingredients, with the same formulation and three different levels of compression pressure. The tablet properties measured were the tensile strength and disintegration time of tablets after two weeks of accelerated test. The material properties measured were the change in tablet thickness before and after the accelerated test, maximum swelling force, swelling time, and swelling rate. The acquired data were added to our previously constructed database containing a total of 20 material properties and 3381 molecular descriptors. The feature importance values of molecular descriptors, material properties and the compression pressure for each tablet property were calculated by random forest, which is one type of machine learning (ML) that uses ensemble learning and decision trees. The results showed that more than half of the top 25 most important features were molecular descriptors for both tablet properties, indicating that molecular descriptors are strongly related to tablet properties. A prediction model of tablet properties was constructed by eight ML types using 25 of the most important features. The results showed that the boosted neural network exhibited the best prediction accuracy and was able to predict tablet properties with high accuracy. A data-driven approach is useful for discovering intricate relationships hidden within complex and large data sets and predicting tablet properties after an accelerated test.

Keywords: data-driven; machine learning; material library; molecular descriptor; quantitative structure–property relationship; tablet.

MeSH terms

  • Databases, Factual
  • Machine Learning*
  • Neural Networks, Computer*
  • Tablets
  • Tensile Strength

Substances

  • Tablets