Machine learning in the classification of asian rust severity in soybean using hyperspectral sensor

Spectrochim Acta A Mol Biomol Spectrosc. 2024 May 15:313:124113. doi: 10.1016/j.saa.2024.124113. Epub 2024 Mar 4.

Abstract

Traditional monitoring of asian soybean rust severity is a time- and labor-intensive task, as it requires visual assessments by skilled professionals in the field. Thus, the use of remote sensing and machine learning (ML) techniques in data processing has emerged as an approach that can increase efficiency in disease monitoring, enabling faster, more accurate and time- and labor-saving evaluations. The aims of the study were: (i) to identify the spectral signature of different levels of Asian soybean rust severity; (ii) to identify the most accurate machine learning algorithm for classifying disease severity levels; (iii) which spectral input provides the highest classification accuracy for the algorithms; (iv) to determine a sample size of leaves that guarantees the best accuracy for the algorithms. A field experiment was carried out in the 2022/2023 harvest in a randomized block design with a 6x3 factorial scheme (ML algorithms x severity levels) and four replications. Disease severity levels assessed were: healthy leaves, 25 % severity, and 50 % severity. Leaf hyperspectral analysis was carried out over a wide range from 350 to 2500 nm. From this analysis, 28 spectral bands were extracted, seeking to distinguish the spectral signature for each severity level with the least input dataset. Data was subjected to machine learning analysis using Artificial Neural Network (ANN), REPTree (DT) and J48 decision trees, Random Forest (RF), and Support Vector Machine (SVM) algorithms, as well as a traditional classification method (Logistic Regression - LR). Two different input datasets were tested for each algorithm: the full spectrum (ALL) provided by the sensor and the 28 spectral bands (SB). Tests with different sample sizes were also conducted to investigate the algorithms' ability to detect severity levels with a reduced sample size. Our findings indicate differences between the spectral curves for the severity levels assessed, which makes it possible to differentiate between healthy plants with low and high severity using hyperspectral sensing. SVM was the most accurate algorithm for classifying severity levels by using all the spectral information as input. This algorithm also provided high classification accuracy when using smaller leaf samples. This study reveals that hyperspectral sensing and the use of ML algorithms provide an accurate classification of different levels of Asian rust severity, and can be powerful tools for a more efficient disease monitoring process.

Keywords: Glycine max (L.); Phytopathology; Plant disease; Remote sensing; Support vector machine.

MeSH terms

  • Algorithms
  • Basidiomycota*
  • Glycine max*
  • Machine Learning
  • Neural Networks, Computer
  • Support Vector Machine