pBACode: a random-barcode-based high-throughput approach for BAC paired-end sequencing and physical clone mapping

Nucleic Acids Res. 2017 Apr 20;45(7):e52. doi: 10.1093/nar/gkw1261.

Abstract

Applications that use Bacterial Artificial Chromosome (BAC) libraries often require paired-end sequences and knowledge of the physical location of each clone in plates. To facilitate obtaining this information in high-throughput, we generated pBACode vectors: a pool of BAC cloning vectors, each with a pair of random barcodes flanking its cloning site. In a pBACode BAC library, the BAC ends and their linked barcodes can be sequenced in bulk. Barcode pairs are determined by sequencing the empty pBACode vectors, which allows BAC ends to be paired according to their barcodes. For physical clone mapping, the barcodes are used as unique markers for their linked genomic sequence. After multi-dimensional pooling of BAC clones, the barcodes are sequenced and deconvoluted to locate each clone. We generated a pBACode library of 94,464 clones for the flounder Paralichthys olivaceus and obtained paired-end sequence from 95.4% of the clones. Incorporating BAC paired-ends into the genome preassembly improved its continuity by over 10-fold. Furthermore, we were able to use the barcodes to map the physical locations of each clone in just 50 pools, with up to 11 808 clones per pool. Our physical clone mapping located 90.2% of BAC clones, enabling targeted characterization of chromosomal rearrangements.

Publication types

  • Validation Study

MeSH terms

  • Animals
  • Chromosomes, Artificial, Bacterial*
  • Cloning, Molecular*
  • Flounder / genetics
  • Gene Library
  • Genome
  • High-Throughput Nucleotide Sequencing / methods*
  • Physical Chromosome Mapping / methods*
  • Saccharomyces cerevisiae / genetics
  • Sequence Analysis, DNA / methods*