Predictive models for charitable giving using machine learning techniques

PLoS One. 2018 Oct 3;13(10):e0203928. doi: 10.1371/journal.pone.0203928. eCollection 2018.

Abstract

Private giving represents more than three fourths of all U.S. charitable donations, about 2% of total Gross Domestic Product (GDP). Private giving is a significant factor in funding the nonprofit sector of the U.S. economy, which accounts for more than 10% of total GDP. Despite the abundance of data available through tax forms and other sources, it is unclear which factors influence private donation, and a reliable predictive mechanism remains elusive. This study aims to develop predictive models to accurately estimate future charitable giving based on a set of potentially influential factors. We have selected several factors, including unemployment rate, household income, poverty level, population, sex, age, ethnicity, education level, and number of vehicles per household. This study sheds light on the relationship between donation and these variables. We use Stepwise Regression to identify the most influential variables among the available variables, based on which predictive models are developed. Multiple Linear Regression (MLR) and machine learning techniques, including Artificial Neural Networks (ANN) and Support Vector Regression (SVR) are used to develop the predictive models. The results suggest that population, education level, and the amount of charitable giving in the previous year are the most significant, independent variables. We propose three predictive models (MLR, ANN, and SVR) and validate them using 10-fold cross-validation method, then evaluate the performance using 9 different measuring criteria. All three models are capable of predicting the amount of future donations in a given region with good accuracy. Based on the evaluation criteria, using a test data set, ANN outperforms SVR and MLR in predicting the amount of charitable giving in the following year.

Publication types

  • Validation Study

MeSH terms

  • Charities / statistics & numerical data*
  • Charities / trends*
  • Data Accuracy
  • Demography
  • Humans
  • Linear Models
  • Machine Learning*
  • Models, Economic*
  • Neural Networks, Computer
  • Socioeconomic Factors
  • United States

Grants and funding

The authors received no specific funding for this work.