A novel method leveraging time series data to improve subphenotyping and application in critically ill patients with COVID-19

Wonsuk Oh; Pushkala Jayaraman; Pranai Tandon; Udit S Chaddha; Patricia Kovatch; Alexander W Charney; Benjamin S Glicksberg; Girish N Nadkarni

doi:10.1016/j.artmed.2023.102750

A novel method leveraging time series data to improve subphenotyping and application in critically ill patients with COVID-19

Artif Intell Med. 2024 Feb:148:102750. doi: 10.1016/j.artmed.2023.102750. Epub 2023 Dec 20.

Authors

Wonsuk Oh¹, Pushkala Jayaraman², Pranai Tandon³, Udit S Chaddha³, Patricia Kovatch⁴, Alexander W Charney⁵, Benjamin S Glicksberg⁶, Girish N Nadkarni⁷

Affiliations

¹ Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA. Electronic address: wonsuk.oh@mssm.edu.
² Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
³ Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁴ Department of Scientific Computing, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁵ Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁶ Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Character Biosciences, New York, NY, USA.
⁷ Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA. Electronic address: girish.nadkarni@mountsinai.org.

PMID: 38325922
PMCID: PMC10864255 (available on 2025-02-01)
DOI: 10.1016/j.artmed.2023.102750

Abstract

Computational subphenotyping, a data-driven approach to understanding disease subtypes, is a prominent topic in medical research. Numerous ongoing studies are dedicated to developing advanced computational subphenotyping methods for cross-sectional data. However, the potential of time-series data has been underexplored until now. Here, we propose a Multivariate Levenshtein Distance (MLD) that can account for address correlation in multiple discrete features over time-series data. Our algorithm has two distinct components: it integrates an optimal threshold score to enhance the sensitivity in discriminating between pairs of instances, and the MLD itself. We have applied the proposed distance metrics on the k-means clustering algorithm to derive temporal subphenotypes from time-series data of biomarkers and treatment administrations from 1039 critically ill patients with COVID-19 and compare its effectiveness to standard methods. In conclusion, the Multivariate Levenshtein Distance metric is a novel method to quantify the distance from multiple discrete features over time-series data and demonstrates superior clustering performance among competing time-series distance metrics.

Keywords: Covid-19; Electronic health records; Time-series distance metrics.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Algorithms
COVID-19*
Critical Illness*
Cross-Sectional Studies
Humans
Time Factors

Abstract

Publication types

MeSH terms

Grants and funding