A quantitative analysis of the impact of explicit incorporation of recency, seasonality and model specialization into fine-grained tourism demand prediction models

PLoS One. 2022 Dec 8;17(12):e0278112. doi: 10.1371/journal.pone.0278112. eCollection 2022.

Abstract

Forecasting is of utmost importance for the Tourism Industry. The development of models to predict visitation demand to specific places is essential to formulate adequate tourism development plans and policies. Yet, only a handful of models deal with the hard problem of fine-grained (per attraction) tourism demand prediction. In this paper, we argue that three key requirements of this type of application should be fulfilled: (i) recency-forecasting models should consider the impact of recent events (e.g. weather change, epidemics and pandemics); (ii) seasonality-tourism behavior is inherently seasonal; and (iii) model specialization-individual attractions may have very specific idiosyncratic patterns of visitations that should be taken into account. These three key requirements should be considered explicitly and in conjunction to advance the state-of-the-art in tourism prediction models. In our experiments, considering a rich set of indoor and outdoor attractions with environmental and social data, the explicit incorporation of such requirements as features into the models improved the rate of highly accurate predictions by more than 320% when compared to the current state-of-the-art in the field. Moreover, they also help to solve very difficult prediction cases, previously poorly solved by the current models. We also investigate the performance of the models in the (simulated) scenarios in which it is impossible to fulfill all three requirements-for instance, when there is not enough historical data for an attraction to capture seasonality. All in all, the main contributions of this paper are the proposal and evaluation of a new information architecture for fine-grained tourism demand prediction models as well as a quantification of the impact of each of the three aforementioned factors on the accuracy of the learned models. Our results have both theoretical and practical implications towards solving important touristic business demands.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Policy*
  • Tourism*

Grants and funding

This research is partially funded by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPQ) and Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG). The authors and grant numbers are as following: ● Amir Khatibi: (CNPq: 169823/2017-2) ● Ana Paula Couto da Silva and Marcos A. Gonçalves: (FAPEMIG: PPM-00543-17 and PPM-00177-18), (CNPq: 422593/2018-4, 310538/2020-3, 310668/2020-4, 402711/2021-1 and 403184/2021-5) ● Jussara Almeida: (CNPq: 305683/2019-5 and 403106/2021-4) The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.