Supervised Learning for Predictive Pore Size Classification of Regenerated Cellulose Membranes Based on Atomic Force Microscopy Measurements

Materials (Basel). 2021 Nov 8;14(21):6724. doi: 10.3390/ma14216724.

Abstract

Nanoporous dialysis membranes made of regenerated cellulose are used as molecular weight cutoff standards in bioseparations. In this study, mesoporous standards with Stokes' radii (50 kDa/2.7 nm, 100 kDa/3.4 nm and 1000 kDa/7.3 nm) and overlapping skewed distributions were characterized using AFM, with the specific aim of generating pore size classifiers for biomimetic membranes using supervised learning. Gamma transformation was used prior to conducting discriminant analysis in terms of the area under the receiver operating curve (AUC) and classification accuracy (Acc). Monte Carlo simulations were run to generate datasets (n = 10) on which logistic regression was conducted using a constant ratio of 80:20 (measurement:algorithm training), followed by algorithm validation by WEKA. The proposed algorithm can classify the 1000 kDa vs. 100 kDa (AUC > 0.8) correctly, but discrimination is weak for the 100 kDa vs. 50 kDa (AUC < 0.7), the latter being attributed to the instrument accuracy errors below 5 nm. As indicated by the results of the cross-validation study, a test size equivalent to 70% (AUCtapping = 0.8341 ± 0.0519, Acctapping = 76.8% ± 5.9%) and 80% (AUCfluid = 0.7614 ± 0.0314, Acctfluid = 76.2% ± 1.0%) of the training sets for the tapping and fluid modes are needed for correct classification, resulting in predicted reduction of scan times.

Keywords: atomic force microscopy; regenerated cellulose; supervised learning.