A machine-readable specification for genomics assays

bioRxiv [Preprint]. 2023 Jul 18:2023.03.17.533215. doi: 10.1101/2023.03.17.533215.

Abstract

Understanding the structure of sequenced fragments from genomics libraries is essential for accurate read preprocessing. Currently, different assays and sequencing technologies require custom scripts and programs that do not leverage the common structure of sequence elements present in genomics libraries. We present seqspec, a machine-readable specification for libraries produced by genomics assays that facilitates standardization of preprocessing and enables tracking and comparison of genomics assays. The specification and associated seqspec command line tool is available at https://github.com/IGVF/seqspec.

Publication types

  • Preprint