Global soil moisture data derived through machine learning trained with in-situ measurements

Sci Data. 2021 Jul 12;8(1):170. doi: 10.1038/s41597-021-00964-1.

Abstract

While soil moisture information is essential for a wide range of hydrologic and climate applications, spatially-continuous soil moisture data is only available from satellite observations or model simulations. Here we present a global, long-term dataset of soil moisture derived through machine learning trained with in-situ measurements, SoMo.ml. We train a Long Short-Term Memory (LSTM) model to extrapolate daily soil moisture dynamics in space and in time, based on in-situ data collected from more than 1,000 stations across the globe. SoMo.ml provides multi-layer soil moisture data (0-10 cm, 10-30 cm, and 30-50 cm) at 0.25° spatial and daily temporal resolution over the period 2000-2019. The performance of the resulting dataset is evaluated through cross validation and inter-comparison with existing soil moisture datasets. SoMo.ml performs especially well in terms of temporal dynamics, making it particularly useful for applications requiring time-varying soil moisture, such as anomaly detection and memory analyses. SoMo.ml complements the existing suite of modelled and satellite-based datasets given its distinct derivation, to support large-scale hydrological, meteorological, and ecological analyses.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Environmental Monitoring
  • Humidity*
  • Machine Learning*
  • Soil*

Substances

  • Soil