Artificial intelligence exceeds humans in epidemiological job coding

Mathijs A Langezaal; Egon L van den Broek; Susan Peters; Marcel Goldberg; Grégoire Rey; Melissa C Friesen; Sarah J Locke; Nathaniel Rothman; Qing Lan; Roel C H Vermeulen

doi:10.1038/s43856-023-00397-4

Artificial intelligence exceeds humans in epidemiological job coding

Commun Med (Lond). 2023 Nov 4;3(1):160. doi: 10.1038/s43856-023-00397-4.

Authors

Affiliations

¹ Population-Based Epidemiological Cohorts Unit UMS11, INSERM, 16 Avenue Paul Vaillant Couturier, Paris, 94807, Villejuif, France. m.a.langezaal@uu.nl.
² Department of Information and Computing Sciences, Utrecht University, Princetonplein 5, Utrecht, 3584CC, Utrecht, The Netherlands. m.a.langezaal@uu.nl.
³ Department of Information and Computing Sciences, Utrecht University, Princetonplein 5, Utrecht, 3584CC, Utrecht, The Netherlands. vandenbroek@acm.org.
⁴ Institute for Risk Assessment Sciences, Utrecht University, Yalelaan 1, Utrecht, 3584CL, Utrecht, The Netherlands.
⁵ Population-Based Epidemiological Cohorts Unit UMS11, INSERM, 16 Avenue Paul Vaillant Couturier, Paris, 94807, Villejuif, France.
⁶ Center for Epidemiology on Medical Causes of Death (CépiDc), INSERM, Le Kremlin-Bicêtre, France.
⁷ Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.

Abstract

Background: Work circumstances can substantially negatively impact health. To explore this, large occupational cohorts of free-text job descriptions are manually coded and linked to exposure. Although several automatic coding tools have been developed, accurate exposure assessment is only feasible with human intervention.

Methods: We developed OPERAS, a customizable decision support system for epidemiological job coding. Using 812,522 entries, we developed and tested classification models for the Professions et Catégories Socioprofessionnelles (PCS)2003, Nomenclature d'Activités Française (NAF)2008, International Standard Classifications of Occupation (ISCO)-88, and ISCO-68. Each code comes with an estimated correctness measure to identify instances potentially requiring expert review. Here, OPERAS' decision support enables an increase in efficiency and accuracy of the coding process through code suggestions. Using the Formaldehyde, Silica, ALOHA, and DOM job-exposure matrices, we assessed the classification models' exposure assessment accuracy.

Results: We show that, using expert-coded job descriptions as gold standard, OPERAS realized a 0.66-0.84, 0.62-0.81, 0.60-0.79, and 0.57-0.78 inter-coder reliability (in Cohen's Kappa) on the first, second, third, and fourth coding levels, respectively. These exceed the respective inter-coder reliability of expert coders ranging 0.59-0.76, 0.56-0.71, 0.46-0.63, 0.40-0.56 on the same levels, enabling a 75.0-98.4% exposure assessment accuracy and an estimated 19.7-55.7% minimum workload reduction.

Conclusions: OPERAS secures a high degree of accuracy in occupational classification and exposure assessment of free-text job descriptions, substantially reducing workload. As such, OPERAS significantly outperforms both expert coders and other current coding tools. This enables large-scale, efficient, and effective exposure assessment securing healthy work conditions.

Plain language summary

Work can expose us to health risks, such as asbestos and constant noise. To study these risks, job descriptions are collected and classified by experts to standard codes. This is time-consuming, expensive, and requires expert knowledge. To improve this coding, we created computer code based on Artificial Intelligence that can both automate this process and suggest codes to experts, who can then check and change it manually if needed. Our system outperforms both expert coders and other available tools. This system could make studying occupational health risks more efficient and accurate, resulting in safer work environments.

Abstract

Plain language summary

Grants and funding