Predicting the Number of Suicides in Japan Using Internet Search Queries: Vector Autoregression Time Series Model

JMIR Public Health Surveill. 2021 Dec 3;7(12):e34016. doi: 10.2196/34016.

Abstract

Background: The number of suicides in Japan increased during the COVID-19 pandemic. Predicting the number of suicides is important to take timely preventive measures.

Objective: This study aims to clarify whether the number of suicides can be predicted by suicide-related search queries used before searching for the keyword "suicide."

Methods: This study uses the infoveillance approach for suicide in Japan by search trends in search engines. The monthly number of suicides by gender, collected and published by the National Police Agency, was used as an outcome variable. The number of searches by gender with queries associated with "suicide" on "Yahoo! JAPAN Search" from January 2016 to December 2020 was used as a predictive variable. The following five phrases highly relevant to suicide were used as search terms before searching for the keyword "suicide" and extracted and used for analyses: "abuse"; "work, don't want to go"; "company, want to quit"; "divorce"; and "no money." The augmented Dickey-Fuller and Johansen tests were performed for the original series and to verify the existence of unit roots and cointegration for each variable, respectively. The vector autoregression model was applied to predict the number of suicides. The Breusch-Godfrey Lagrangian multiplier (BG-LM) test, autoregressive conditional heteroskedasticity Lagrangian multiplier (ARCH-LM) test, and Jarque-Bera (JB) test were used to confirm model convergence. In addition, a Granger causality test was performed for each predictive variable.

Results: In the original series, unit roots were found in the trend model, whereas in the first-order difference series, both men (minimum tau 3: -9.24; max tau 3: -5.38) and women (minimum tau 3: -9.24; max tau 3: -5.38) had no unit roots for all variables. In the Johansen test, a cointegration relationship was observed among several variables. The queries used in the converged models were "divorce" for men (BG-LM test: P=.55; ARCH-LM test: P=.63; JB test: P=.66) and "no money" for women (BG-LM test: P=.17; ARCH-LM test: P=.15; JB test: P=.10). In the Granger causality test for each variable, "divorce" was significant for both men (F104=3.29; P=.04) and women (F104=3.23; P=.04).

Conclusions: The number of suicides can be predicted by search queries related to the keyword "suicide." Previous studies have reported that financial poverty and divorce are associated with suicide. The results of this study, in which search queries on "no money" and "divorce" predicted suicide, support the findings of previous studies. Further research on the economic poverty of women and those with complex problems is necessary.

Keywords: COVID-19; Japan; behavior; information seeking; infoveillance; internet; internet search engine; loneliness; mental health; model; prediction; query; suicide; suicide-related terms; time series; time series analysis; trend; vector autoregression model.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • COVID-19*
  • Female
  • Humans
  • Infodemiology
  • Internet
  • Japan / epidemiology
  • Male
  • Pandemics
  • SARS-CoV-2
  • Suicide*
  • Time Factors