Massively Parallel Assays and Quantitative Sequence-Function Relationships

Annu Rev Genomics Hum Genet. 2019 Aug 31:20:99-127. doi: 10.1146/annurev-genom-083118-014845. Epub 2019 May 15.

Abstract

Over the last decade, a rich variety of massively parallel assays have revolutionized our understanding of how biological sequences encode quantitative molecular phenotypes. These assays include deep mutational scanning, high-throughput SELEX, and massively parallel reporter assays. Here, we review these experimental methods and how the data they produce can be used to quantitatively model sequence-function relationships. In doing so, we touch on a diverse range of topics, including the identification of clinically relevant genomic variants, the modeling of transcription factor binding to DNA, the functional and evolutionary landscapes of proteins, and cis-regulatory mechanisms in both transcription and mRNA splicing. We further describe a unified conceptual framework and a core set of mathematical modeling strategies that studies in these diverse areas can make use of. Finally, we highlight key aspects of experimental design and mathematical modeling that are important for the results of such studies to be interpretable and reproducible.

Keywords: -regulatory grammar; biophysical modeling; deep learning; epistasis; genotype–phenotype map; variants of uncertain significance.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • DNA / genetics
  • DNA / metabolism
  • Epistasis, Genetic*
  • Genetic Association Studies*
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Models, Genetic*
  • Mutation
  • Phenotype
  • Protein Binding
  • RNA Splicing
  • SELEX Aptamer Technique / methods*
  • Transcription Factors / genetics
  • Transcription Factors / metabolism
  • Transcription, Genetic

Substances

  • Transcription Factors
  • DNA