A New Approach to Compare the Performance of Two Classification Methods of Causes of Death for Timely Surveillance in France

Stud Health Technol Inform. 2019 Aug 21:264:925-929. doi: 10.3233/SHTI190359.

Abstract

Timely mortality surveillance in France is based on the monitoring of electronic death certificates to provide information to health authorities. This study aims to analyze the performance of a rule-based and a supervised machine learning method to classify medical causes of death into 60 mortality syndromic groups (MSGs). Performance was first measured on a test set. Then we compared the trends of the monthly numbers of deaths classified into MSGs from 2012 to 2016 using both methods. Among the 60 MSGs, 31 achieved recall and precision over 0.95 for either one or the other method on the test set. On the whole dataset, the correlation coefficient of the monthly numbers of deaths obtained by the two methods were close to 1 for 21 of the 31 MSGs. This approach is useful for analyzing a large number of categories or when annotated resources are limited.

Keywords: Machine learning; cause of death; sentinel surveillance.

Publication types

  • Comparative Study

MeSH terms

  • Cause of Death*
  • Death Certificates*
  • France
  • Health Resources
  • Humans
  • Supervised Machine Learning*