Machine learning and statistical models for predicting indoor air quality

Indoor Air. 2019 Sep;29(5):704-726. doi: 10.1111/ina.12580. Epub 2019 Jul 25.

Abstract

Indoor air quality (IAQ), as determined by the concentrations of indoor air pollutants, can be predicted using either physically based mechanistic models or statistical models that are driven by measured data. In comparison with mechanistic models mostly used in unoccupied or scenario-based environments, statistical models have great potential to explore IAQ captured in large measurement campaigns or in real occupied environments. The present study carried out the first literature review of the use of statistical models to predict IAQ. The most commonly used statistical modeling methods were reviewed and their strengths and weaknesses discussed. Thirty-seven publications, in which statistical models were applied to predict IAQ, were identified. These studies were all published in the past decade, indicating the emergence of the awareness and application of machine learning and statistical modeling in the field of IAQ. The concentrations of indoor particulate matter (PM2.5 and PM10 ) were the most frequently studied parameters, followed by carbon dioxide and radon. The most popular statistical models applied to IAQ were artificial neural networks, multiple linear regression, partial least squares, and decision trees.

Keywords: IAQ; artificial neural networks; data mining; partial least squares; particulate matter; regression.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Air Pollutants / analysis
  • Air Pollution, Indoor / analysis*
  • Decision Trees*
  • Environmental Monitoring / methods
  • Humans
  • Machine Learning*
  • Models, Statistical
  • Neural Networks, Computer*
  • Regression Analysis*

Substances

  • Air Pollutants