Opportunities for basic, clinical, and bioethics research at the intersection of machine learning and genomics

Cell Genom. 2024 Jan 10;4(1):100466. doi: 10.1016/j.xgen.2023.100466. Epub 2023 Dec 15.

Abstract

The data-intensive fields of genomics and machine learning (ML) are in an early stage of convergence. Genomics researchers increasingly seek to harness the power of ML methods to extract knowledge from their data; conversely, ML scientists recognize that genomics offers a wealth of large, complex, and well-annotated datasets that can be used as a substrate for developing biologically relevant algorithms and applications. The National Human Genome Research Institute (NHGRI) inquired with researchers working in these two fields to identify common challenges and receive recommendations to better support genomic research efforts using ML approaches. Those included increasing the amount and variety of training datasets by integrating genomic with multiomics, context-specific (e.g., by cell type), and social determinants of health datasets; reducing the inherent biases of training datasets; prioritizing transparency and interpretability of ML methods; and developing privacy-preserving technologies for research participants' data.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Bioethics*
  • Genomics*
  • Humans
  • Machine Learning
  • Privacy