Engineered yeast genomes accurately assembled from pure and mixed samples

Nat Commun. 2021 Mar 5;12(1):1485. doi: 10.1038/s41467-021-21656-9.

Abstract

Yeast whole genome sequencing (WGS) lacks end-to-end workflows that identify genetic engineering. Here we present Prymetime, a tool that assembles yeast plasmids and chromosomes and annotates genetic engineering sequences. It is a hybrid workflow-it uses short and long reads as inputs to perform separate linear and circular assembly steps. This structure is necessary to accurately resolve genetic engineering sequences in plasmids and the genome. We show this by assembling diverse engineered yeasts, in some cases revealing unintended deletions and integrations. Furthermore, the resulting whole genomes are high quality, although the underlying assembly software does not consistently resolve highly repetitive genome features. Finally, we assemble plasmids and genome integrations from metagenomic sequencing, even with 1 engineered cell in 1000. This work is a blueprint for building WGS workflows and establishes WGS-based identification of yeast genetic engineering.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Base Sequence
  • Chromosomes
  • Chromosomes, Artificial, Yeast
  • Cloning, Molecular
  • Computer Simulation
  • Contig Mapping / methods
  • Genetic Engineering / methods*
  • Genome, Fungal*
  • Metagenome
  • Metagenomics
  • Plasmids
  • Saccharomyces cerevisiae / genetics*
  • Software
  • Transformation, Genetic
  • Whole Genome Sequencing / methods*