Statistical analysis, machine learning modeling, and text analytics of aggregation attachment efficiency: Mono and binary particle systems

J Hazard Mater. 2023 Jul 15:454:131482. doi: 10.1016/j.jhazmat.2023.131482. Epub 2023 Apr 24.

Abstract

The aggregation attachment efficiency (α) is the fraction of particle-particle collisions resulting in aggregation. Despite significant research, α predictions have not accounted for the full complexity of systems due to constraints imposed by particle types, dispersed matter, water chemistry, quantification methods, and modeling. Experimental α values are often case-specific, and simplified systems are used to rule out complexity. To address these challenges, statistical analysis was performed on α databases to identify gaps in current knowledge, and machine learning (ML) was used to predict α under various particle types and conditions. Moreover, text analytics was employed to support knowledge from statistics and ML, as well as gain insight into the ideas communicated by current literature. Most studies investigated α in mono-particle systems, but binary or higher systems require more investigation. Furthermore, our work highlights that numerous variables, interactions, and mechanisms influence α behavior, making its investigation complex and difficult for both experiments and modeling. Consequently, future research should incorporate more particle types, shapes, coatings, and surface heterogeneities, and aim to address overlooked variables and conditions. Therefore, building a comprehensive α database can enable the development of more accurate empirical models for prediction.

Keywords: Machine learning; Missing data imputation; Text analytics; Topic modeling; Word correlation.