A fully interpretable machine learning model for increasing the effectiveness of urine screening

Am J Clin Pathol. 2023 Dec 1;160(6):620-632. doi: 10.1093/ajcp/aqad099.

Abstract

Objectives: This article addresses the need for effective screening methods to identify negative urine samples before urine culture, reducing the workload, cost, and release time of results in the microbiology laboratory. We try to overcome the limitations of current solutions, which are either too simple, limiting effectiveness (1 or 2 parameters), or too complex, limiting interpretation, trust, and real-world implementation ("black box" machine learning models).

Methods: The study analyzed 15,312 samples from 10,534 patients with clinical features and the Sysmex Uf-1000i automated analyzer data. Decision tree (DT) models with or without lookahead strategy were used, as they offer a transparent set of logical rules that can be easily understood by medical professionals and implemented into automated analyzers.

Results: The best model achieved a sensitivity of 94.5% and classified negative samples based on age, bacteria, mucus, and 2 scattering parameters. The model reduced the workload by an additional 16% compared to the current procedure in the laboratory, with an estimated financial impact of €40,000/y considering 15,000 samples/y. Identified logical rules have a scientific rationale matched to existing knowledge in the literature.

Conclusions: Overall, this study provides an effective and interpretable screening method for urine culture in microbiology laboratories, using data from the Sysmex UF-1000i automated analyzer. Unlike other machine learning models, our model is interpretable, generating trust and enabling real-world implementation.

Keywords: data science; decision tree; machine learning; urinalysis.

MeSH terms

  • Bacteria
  • Flow Cytometry / methods
  • Humans
  • Machine Learning
  • Urinalysis / methods
  • Urinary Tract Infections* / diagnosis
  • Urinary Tract Infections* / microbiology
  • Urinary Tract Infections* / urine