Hybrid ensemble-based machine learning model for predicting phosphorus concentrations in hydroponic solution

Spectrochim Acta A Mol Biomol Spectrosc. 2024 Jan 5:304:123327. doi: 10.1016/j.saa.2023.123327. Epub 2023 Sep 1.

Abstract

Accurate, label-free, and rapid methods for measuring phosphorus concentrations are essential in a hydroponic system, as excessive or insufficient phosphorus levels can adversely affect plant growth, human health, and environmental sustainability. In this study, we demonstrate the advantages of hybrid machine learning models compared to single machine learning models in predicting phosphorus concentration based on the absorbance dataset. Three machine learning classifiers- Random Forest (RF), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN)- were employed as bases for single and hybrid machine learning models. Three ensemble techniques (voting, bagging, and stacking) were used to hybridize the classifiers. Among the single models, KNN demonstrated the fastest computational time of 18.07 s, while SVM achieved the highest accuracy of 99.6%. The hybrid SVM/KNN model using a voting classifier showed a significant increase in accuracy for KNN with only a slight increase in computational time. Bagging techniques increased the accuracy but at a longer computational time. The stacking technique, which combined SVM, KNN, and RF, achieved the highest accuracy of 99.73% with a short computational time of 36.18 s compared to the bagging and voting technique. This study demonstrates that the machine learning method can effectively distinguish phosphorus concentrations. In contrast, hybrid machine learning techniques can improve accuracy for predicting phosphorus without using labels, despite requiring longer computational time.

Keywords: Ensemble technique; Hydroponic; Machine learning; Nutrient; Spectroscopy.