Linear and non-linear chemometric modeling of THM formation in Barcelona's water treatment plant

Sci Total Environ. 2012 Aug 15:432:365-74. doi: 10.1016/j.scitotenv.2012.05.097. Epub 2012 Jul 1.

Abstract

The complex behavior observed for the dependence of trihalomethane formation on forty one water treatment plant (WTP) operational variables is investigated by means of linear and non-linear regression methods, including kernel-partial least squares (K-PLS), and support vector machine regression (SVR). Lower prediction errors of total trihalomethane concentrations (lower than 14% for external validation samples) were obtained when these two methods were applied in comparison to when linear regression methods were applied. A new visualization technique revealed the complex nonlinear relationships among the operational variables and displayed the existing correlations between input variables and the kernel matrix on one side and the support vectors on the other side. Whereas some water treatment plant variables like river water TOC and chloride concentrations, and breakpoint chlorination were not considered to be significant due to the multi-collinear effect in straight linear regression modeling methods, they were now confirmed to be significant using K-PLS and SVR non-linear modeling regression methods, proving the better performance of these methods for the prediction of complex formation of trihalomethanes in water disinfection plants.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chlorine / chemistry
  • Disinfection
  • Environmental Monitoring / methods*
  • Least-Squares Analysis
  • Linear Models
  • Models, Chemical
  • Nephelometry and Turbidimetry
  • Spain
  • Trihalomethanes / analysis
  • Trihalomethanes / chemistry*
  • Water Pollutants, Chemical / analysis
  • Water Pollutants, Chemical / chemistry*
  • Water Purification*
  • Water Supply

Substances

  • Trihalomethanes
  • Water Pollutants, Chemical
  • Chlorine