Dataset of two decades of Tiger Woods press conferences and tournament performance

Data Brief. 2022 Feb 15:41:107955. doi: 10.1016/j.dib.2022.107955. eCollection 2022 Apr.

Abstract

This data article describes a dataset that allows exploring the determinants of superstars' sentiment in tournaments. It consists of 1,284 press conferences of Tiger Woods in the PGA Tour between 1996 and 2020. We used natural language processing, a form of artificial intelligence, to extract and encode in a quantitative form the sentiment in Tiger Woods press conferences both before the tournament and after the rounds played. Additionally, the dataset provides a series of variables that describe Tiger Woods' scoring and performance momentum in each round and variables that describe health-related and off-the-course issues that could affect his performance on the course. This data can be useful to understand the sentiment that superstars go through before important tournaments, their sentiment following a major victory or defeat, how that sentiment evolves throughout their athletic career, and how sentiment is associated with performance momentum.

Keywords: Machine learning; Natural language processing; Press conferences; Sentiment analysis; Tiger Woods.

Associated data

  • figshare/10.6084/m9.figshare.16915294.v4