Auto-classification of biomass through characterization of their pyrolysis behaviors using thermogravimetric analysis with support vector machine algorithm: case study for tobacco

Biotechnol Biofuels. 2021 Apr 27;14(1):106. doi: 10.1186/s13068-021-01942-w.

Abstract

Background: During the biomass-to-bio-oil conversion process, many studies focus on studying the association between biomass and bio-products using near-infrared spectra (NIR) and chemical analysis methods. However, the characterization of biomass pyrolysis behaviors using thermogravimetric analysis (TGA) with support vector machine (SVM) algorithm has not been reported. In this study, tobacco was chosen as the object for biomass, because the cigarette smoke (including water, tar, and gases) released by tobacco pyrolysis reactions decides the sensory quality, which is similar to biomass as a renewable resource through the pyrolysis process.

Results: SVM algorithm has been employed to automatically classify the planting area and growing position of tobacco leaves using thermogravimetric analysis data as the information source for the first time. Eighty-eight single-grade tobacco samples belonging to four grades and eight categories were split into the training, validation, and blind testing sets. Our model showed excellent performances in both the training and validation set as well as in the blind test, with accuracy over 91.67%. Throughout the whole dataset of 88 samples, our model not only provides precise results on the planting area of tobacco leave, but also accurately distinguishes the major grades among the upper, lower, and middle positions. The error only occurs in the classification of subgrades of the middle position.

Conclusions: From the case study of tobacco, our results validated the feasibility of using TGA with SVM algorithm as an objective and fast method for auto-classification of tobacco planting area and growing position. In view of the high similarity between tobacco and other biomasses in the compositions and pyrolysis behaviors, this new protocol, which couples the TGA data with SVM algorithm, can potentially be extrapolated to the auto-classification of other biomass types.

Keywords: Machine learning; SVM algorithm; Thermogravimetric analysis; Tobacco.