Improving Predictive Accuracy in Elections

Big Data. 2017 Dec;5(4):325-336. doi: 10.1089/big.2017.0047.

Abstract

The problem of accurately predicting vote counts in elections is considered in this article. Typically, small-sample polls are used to estimate or predict election outcomes. In this study, a machine-learning hybrid approach is proposed. This approach utilizes multiple sets of static data sources, such as voter registration data, and dynamic data sources, such as polls and donor data, to develop individualized voter scores for each member of the population. These voter scores are used to estimate expected vote counts under different turnout scenarios. The proposed technique has been tested with data collected during U.S. Senate and Louisiana gubernatorial elections. The predicted results (expected vote counts, predicted several days before the actual election) were accurate within 1%.

Keywords: behavioral analytics; computational social sciences; data science; machine learning; political big data; predict election outcomes; predictive analytics; voter scores.

MeSH terms

  • Algorithms
  • Computer Simulation
  • Humans
  • Machine Learning
  • Politics*
  • United States