In-silico generation of high-dimensional immune response data in patients using a deep neural network

Ramin Fallahzadeh; Neda H Bidoki; Ina A Stelzer; Martin Becker; Ivana Marić; Alan L Chang; Anthony Culos; Thanaphong Phongpreecha; Maria Xenochristou; Davide De Francesco; Camilo Espinosa; Eloise Berson; Franck Verdonk; Martin S Angst; Brice Gaudilliere; Nima Aghaeepour

doi:10.1002/cyto.a.24709

In-silico generation of high-dimensional immune response data in patients using a deep neural network

Cytometry A. 2023 May;103(5):392-404. doi: 10.1002/cyto.a.24709. Epub 2022 Dec 27.

Authors

Ramin Fallahzadeh^{1

2}, Neda H Bidoki^{1

2}, Ina A Stelzer¹, Martin Becker^{1

2}, Ivana Marić³, Alan L Chang^{1

2}, Anthony Culos^{1

2}, Thanaphong Phongpreecha^{1

2

4}, Maria Xenochristou^{1

2}, Davide De Francesco^{1

2}, Camilo Espinosa^{1

2}, Eloise Berson^{1

2}, Franck Verdonk¹, Martin S Angst¹, Brice Gaudilliere^{1

3}, Nima Aghaeepour^{1

2

3}

Affiliations

¹ Department of Anesthesiology, Pain and Perioperative Medicine, Stanford University, Stanford, California, USA.
² Department of Biomedical Data Science, Stanford University, Stanford, California, USA.
³ Department of Pediatrics, Stanford University, Stanford, California, USA.
⁴ Department of Pathology, Stanford University, Stanford, California, USA.

Abstract

Technologies for single-cell profiling of the immune system have enabled researchers to extract rich interconnected networks of cellular abundance, phenotypical and functional cellular parameters. These studies can power machine learning approaches to understand the role of the immune system in various diseases. However, the performance of these approaches and the generalizability of the findings have been hindered by limited cohort sizes in translational studies, partially due to logistical demands and costs associated with longitudinal data collection in sufficiently large patient cohorts. An evolving challenge is the requirement for ever-increasing cohort sizes as the dimensionality of datasets grows. We propose a deep learning model derived from a novel pipeline of optimal temporal cell matching and overcomplete autoencoders that uses data from a small subset of patients to learn to forecast an entire patient's immune response in a high dimensional space from one timepoint to another. In our analysis of 1.08 million cells from patients pre- and post-surgical intervention, we demonstrate that the generated patient-specific data are qualitatively and quantitatively similar to real patient data by demonstrating fidelity, diversity, and usefulness.

Keywords: autoencoders; deep learning; generative modeling; mass cytometry; single-cell Immunome; surgery.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Humans
Machine Learning*
Neural Networks, Computer*
Proteomics

Grants and funding

R35 GM138353/GM/NIGMS NIH HHS/United States