Chromosome-Length Haplotypes with StrandPhaseR and Strand-seq

Methods Mol Biol. 2023:2590:183-200. doi: 10.1007/978-1-0716-2819-5_12.

Abstract

Dense local haplotypes can now readily be extracted from long-read or droplet-based sequence data. However, these methods struggle to combine subchromosomal haplotype blocks into global chromosome-length haplotypes. Strand-seq is a single cell sequencing technique that uses read orientation to capture sparse global phase information by sequencing only one of two DNA strands for each parental homolog. In combination with dense local haplotypes from other technologies, Strand-seq data can be used to obtain complete chromosome-length phase information. In this chapter, we run the R package StrandPhaseR to phase SNVs using publicly available sequence data for sample HG005 of the Genome in a Bottle project.

Keywords: Genome in a Bottle; Haplotype; Phasing; Strand-seq; StrandPhaseR.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromosomes*
  • Genome*
  • Haplotypes
  • High-Throughput Nucleotide Sequencing / methods
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, DNA / methods

Grants and funding