Development of a Prediction Model for Demolition Waste Generation Using a Random Forest Algorithm Based on Small DataSets

Int J Environ Res Public Health. 2020 Sep 24;17(19):6997. doi: 10.3390/ijerph17196997.

Abstract

Recently, artificial intelligence (AI) technologies have been employed to predict construction and demolition (C&D) waste generation. However, most studies have used machine learning models with continuous data input variables, applying algorithms, such as artificial neural networks, adaptive neuro-fuzzy inference systems, support vector machines, linear regression analysis, decision trees, and genetic algorithms. Therefore, machine learning algorithms may not perform as well when applied to categorical data. This article uses machine learning algorithms to predict C&D waste generation from a dataset, as a way to improve the accuracy of waste management in C&D facilities. These datasets include categorical (e.g., region, building structure, building use, wall material, and roofing material), and continuous data (particularly, gloss floor area), and a random forest (RF) algorithm was used. Results indicate that RF is an adequate machine learning algorithm for a small dataset consisting of categorical data, and even with a small dataset, an adequate prediction model can be developed. Despite the small dataset, the predictive performance according to the demolition waste (DW) type was R (Pearson's correlation coefficient) = 0.691-0.871, R2 (coefficient of determination) = 0.554-0.800, showing stable prediction performance. High prediction performance was observed using three (for mortar), five (for other DW types), or six (for concrete) input variables. This study is significant because the proposed RF model can predict DW generation using a small amount of data. Additionally, it demonstrates the possibility of applying AI to multi-purpose DW management.

Keywords: construction waste management; demolition waste management; leave-one-out cross-validation; prediction model; random forest; small data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Construction Industry
  • Machine Learning
  • Neural Networks, Computer*
  • Solid Waste*
  • Support Vector Machine

Substances

  • Solid Waste