Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments

PLoS One. 2018 Dec 5;13(12):e0207624. doi: 10.1371/journal.pone.0207624. eCollection 2018.

Abstract

The performance of three machine learning methods (support vector regression, random forests and artificial neural network) for estimating the LAI of paddy rice was evaluated in this study. Traditional univariate regression models involving narrowband NDVI with optimized band combinations as well as linear multivariate calibration partial least squares regression models were also evaluated for comparison. A four year field-collected dataset was used to test the robustness of LAI estimation models against temporal variation. The partial least squares regression and three machine learning methods were built on the raw hyperspectral reflectance and the first derivative separately. Two different rules were used to determine the models' key parameters. The results showed that the combination of the red edge and NIR bands (766 nm and 830 nm) as well as the combination of SWIR bands (1114 nm and 1190 nm) were optimal for producing the narrowband NDVI. The models built on the first derivative spectra yielded more accurate results than the corresponding models built on the raw spectra. Properly selected model parameters resulted in comparable accuracy and robustness with the empirical optimal parameter and significantly reduced the model complexity. The machine learning methods were more accurate and robust than the VI methods and partial least squares regression. When validating the calibrated models against the standalone validation dataset, the VI method yielded a validation RMSE value of 1.17 for NDVI(766,830) and 1.01 for NDVI(1114,1190), while the best models for the partial least squares, support vector machine and artificial neural network methods yielded validation RMSE values of 0.84, 0.82, 0.67 and 0.84, respectively. The RF models built on the first derivative spectra with mtry = 10 showed the highest potential for estimating the LAI of paddy rice.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • China
  • Ecological Parameter Monitoring / methods*
  • Least-Squares Analysis
  • Linear Models
  • Machine Learning
  • Oryza / growth & development
  • Plant Leaves / growth & development*
  • Spectrum Analysis / methods*
  • Support Vector Machine

Grants and funding

This work is supported by the Natural Science Foundation of China (41701398 to F.L.), the Hi-Tech Research and Development Program (863) of China (2013AA102401-2 to Q.C.) and the Fundamental Research Funds for the Central Universities (2452017108 to F.L.). We acknowledge the support given by the Institute of Agricultural Resources and Environment, Ningxia Academy of Agro-forestry science, Yinchuan. The funder provided support in the form of salaries for authors, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.