A LSTM-Hawkes hybrid model for posterior click distribution forecast in the advertising network environment

PLoS One. 2020 Jun 5;15(6):e0232887. doi: 10.1371/journal.pone.0232887. eCollection 2020.

Abstract

In the field of advertising technology, it is a key task to forecast posterior click distribution since 66% of advertising transactions depend on cost per click model. However, due to the General Data Protection Regulation, machine learning techniques to forecast posterior click distribution based on the sequences of an identified user's actions are restricted in European countries. To overcome this barrier, we introduce a contextual behavior concept for the advertising network environment and propose a new hybrid model, which we call the Long Short Term Memory-Hawkes model by combining a stochastic-based generative model and a machine learning-based predictive model. Also, to meet the computational efficiency for the heavy demand in mobile advertisement market, we define gradient exponential kernel with just three hyper parameters to minimize residuals. We have carefully tested our proposed model with production data and found that the LSTM-Hawkes model reduces the Mean Squared Error by at least 27.1% and up to 83.8% on average in comparison to the existing Hawkes Process based algorithm, Hawkes Intensity Process, as well as 39.77% on average in comparison to Multivariate Linear Regression. We have also found that our proposed model improves the forecast accuracy by about 21.2% on average.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Advertising / methods*
  • Advertising / statistics & numerical data*
  • Algorithms
  • Europe
  • Forecasting / methods*
  • Humans
  • Machine Learning
  • Neural Networks, Computer
  • Stochastic Processes

Grants and funding

This work was supported by Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korea government (MSIP) (No.2020-0-00107, Development of the technology to automate the recommendations for big data analytic models that define data characteristics and problems).