Machine learning-based approaches for identifying human blood cells harboring CRISPR-mediated fetal chromatin domain ablations

Sci Rep. 2022 Jan 27;12(1):1481. doi: 10.1038/s41598-022-05575-3.

Abstract

Two common hemoglobinopathies, sickle cell disease (SCD) and β-thalassemia, arise from genetic mutations within the β-globin gene. In this work, we identified a 500-bp motif (Fetal Chromatin Domain, FCD) upstream of human ϒ-globin locus and showed that the removal of this motif using CRISPR technology reactivates the expression of ϒ-globin. Next, we present two different cell morphology-based machine learning approaches that can be used identify human blood cells (KU-812) that harbor CRISPR-mediated FCD genetic modifications. Three candidate models from the first approach, which uses multilayer perceptron algorithm (MLP 20-26, MLP26-18, and MLP 30-26) and flow cytometry-derived cellular data, yielded 0.83 precision, 0.80 recall, 0.82 accuracy, and 0.90 area under the ROC (receiver operating characteristic) curve when predicting the edited cells. In comparison, the candidate model from the second approach, which uses deep learning (T2D5) and DIC microscopy-derived imaging data, performed with less accuracy (0.80) and ROC AUC (0.87). We envision that equivalent machine learning-based models can complement currently available genotyping protocols for specific genetic modifications which result in morphological changes in human cells.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Anemia, Sickle Cell / blood
  • Anemia, Sickle Cell / genetics
  • Anemia, Sickle Cell / therapy*
  • CRISPR-Cas Systems / genetics
  • Cell Line, Tumor
  • Cell Separation / methods*
  • Flow Cytometry / methods
  • Gene Editing / methods
  • Genetic Therapy / methods
  • Genotyping Techniques / methods*
  • Humans
  • Machine Learning
  • Mutation
  • Protein Domains / genetics
  • ROC Curve
  • beta-Thalassemia / blood
  • beta-Thalassemia / genetics
  • beta-Thalassemia / therapy*
  • gamma-Globins / genetics*

Substances

  • gamma-Globins