Designing Sustainable Hydrophilic Interfaces via Feature Selection from Molecular Descriptors and Time-Domain Nuclear Magnetic Resonance Relaxation Curves

Polymers (Basel). 2024 Mar 15;16(6):824. doi: 10.3390/polym16060824.

Abstract

Surface modification using hydrophilic polymer coatings is a sustainable approach for preventing membrane clogging due to foulant adhesion to water treatment membranes and reducing membrane-replacement frequency. Typically, both molecular descriptors and time-domain nuclear magnetic resonance (TD-NMR) data, which reveal physicochemical properties and polymer-chain dynamics, respectively, are required to predict the properties and understand the mechanisms of hydrophilic polymer coatings. However, studies on the selection of essential components from high-dimensional data and their application to the prediction of surface properties are scarce. Therefore, we developed a method for selecting features from combined high-dimensional molecular descriptors and TD-NMR data. The molecular descriptors of the monomers present in polyethylene terephthalate films were calculated using RDKit, an open-source chemoinformatics toolkit, and TD-NMR spectroscopy was performed over a wide time range using five-pulse sequences to investigate the mobility of the polymer chains. The model that analyzed the data using the random forest algorithm, after reducing the features using gradient boosting machine-based recursive feature elimination, achieved the highest prediction accuracy. The proposed method enables the extraction of important elements from both descriptors of surface properties and can contribute to the development of new sustainable materials and material-specific informatics methodologies encompassing multiple information modalities.

Keywords: chemoinformatics descriptors; contact angle; hydrophilic coating materials; machine learning; time-domain nuclear magnetic resonance.

Grants and funding

This research received no external funding.