Clustering and Kernel Density Estimation for Assessment of Measurable Residual Disease by Flow Cytometry

Hugues Jacqmin; Bernard Chatelain; Quentin Louveaux; Philippe Jacqmin; Jean-Michel Dogné; Carlos Graux; François Mullier

doi:10.3390/diagnostics10050317

Clustering and Kernel Density Estimation for Assessment of Measurable Residual Disease by Flow Cytometry

Diagnostics (Basel). 2020 May 18;10(5):317. doi: 10.3390/diagnostics10050317.

Authors

Hugues Jacqmin¹, Bernard Chatelain¹, Quentin Louveaux², Philippe Jacqmin³, Jean-Michel Dogné⁴, Carlos Graux⁵, François Mullier¹

Affiliations

¹ Hematology Laboratory, NAmur Research Institute for LIfe Sciences (NARILIS), Namur Thrombosis and Hemostasis Center (NTHC), CHU UCL Namur, Université catholique de Louvain, 5530 Yvoir, Belgium.
² Montefiore Institute, University of Liege, 4000 Liège, Belgium.
³ MnS-Modelling and Simulation, 5500 Dinant, Belgium.
⁴ Pharmacy Department, University of Namur, 5000 Namur, Belgium.
⁵ Department of Hematology, Namur Research Institute for Life Sciences (NARILIS), Namur Thrombosis and Hemostasis Center (NTHC), CHU UCL Namur, Université catholique de Louvain, 5530 Yvoir, Belgium.

Abstract

Standardization, data mining techniques, and comparison to normality are changing the landscape of multiparameter flow cytometry in clinical hematology. On the basis of these principles, a strategy was developed for measurable residual disease (MRD) assessment. Herein, suspicious cell clusters are first identified at diagnosis using a clustering algorithm. Subsequently, automated multidimensional spaces, named "Clouds", are created around these clusters on the basis of density calculations. This step identifies the immunophenotypic pattern of the suspicious cell clusters. Thereafter, using reference samples, the "Abnormality Ratio" (AR) of each Cloud is calculated, and major malignant Clouds are retained, known as "Leukemic Clouds" (L-Clouds). In follow-up samples, MRD is identified when more cells fall into a patient's L-Cloud compared to reference samples (AR concept). This workflow was applied on simulated data and real-life leukemia flow cytometry data. On simulated data, strong patient-dependent positive correlation (R² = 1) was observed between the AR and spiked-in leukemia cells. On real patient data, AR kinetics was in line with the clinical evolution for five out of six patients. In conclusion, we present a convenient flow cytometry data analysis approach for the follow-up of hematological malignancies. Further evaluation and validation on more patient samples and different flow cytometry panels is required before implementation in clinical practice.

Keywords: acute myeloid leukemia (AML); clustering; flow cytometry; kernel density estimation; multiparametric data analysis; personalized medicine.