[Prediction of regional soil quality based on mutual information theory integrated with decision tree algorithm]

Ying Yong Sheng Tai Xue Bao. 2012 Feb;23(2):452-8.
[Article in Chinese]

Abstract

In this paper, some main factors such as soil type, land use pattern, lithology type, topography, road, and industry type that affect soil quality were used to precisely obtain the spatial distribution characteristics of regional soil quality, mutual information theory was adopted to select the main environmental factors, and decision tree algorithm See 5.0 was applied to predict the grade of regional soil quality. The main factors affecting regional soil quality were soil type, land use, lithology type, distance to town, distance to water area, altitude, distance to road, and distance to industrial land. The prediction accuracy of the decision tree model with the variables selected by mutual information was obviously higher than that of the model with all variables, and, for the former model, whether of decision tree or of decision rule, its prediction accuracy was all higher than 80%. Based on the continuous and categorical data, the method of mutual information theory integrated with decision tree could not only reduce the number of input parameters for decision tree algorithm, but also predict and assess regional soil quality effectively.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Decision Trees*
  • Ecosystem*
  • Environmental Monitoring
  • Forecasting
  • Quality Control
  • Soil / analysis*
  • Soil Pollutants / analysis

Substances

  • Soil
  • Soil Pollutants