Ub-ISAP: a streamlined UNIX pipeline for mining unique viral vector integration sites from next generation sequencing data

BMC Bioinformatics. 2017 Jun 17;18(1):305. doi: 10.1186/s12859-017-1719-4.

Abstract

Background: The analysis of viral vector genomic integration sites is an important component in assessing the safety and efficiency of patient treatment using gene therapy. Alongside this clinical application, integration site identification is a key step in the genetic mapping of viral elements in mutagenesis screens that aim to elucidate gene function.

Results: We have developed a UNIX-based vector integration site analysis pipeline (Ub-ISAP) that utilises a UNIX-based workflow for automated integration site identification and annotation of both single and paired-end sequencing reads. Reads that contain viral sequences of interest are selected and aligned to the host genome, and unique integration sites are then classified as transcription start site-proximal, intragenic or intergenic.

Conclusion: Ub-ISAP provides a reliable and efficient pipeline to generate large datasets for assessing the safety and efficiency of integrating vectors in clinical settings, with broader applications in cancer research. Ub-ISAP is available as an open source software package at https://sourceforge.net/projects/ub-isap/ .

Keywords: Gene therapy; Integration site analysis; Next-generation sequencing; Viral vectors.

MeSH terms

  • Chromosome Mapping / methods*
  • Computational Biology / methods*
  • Genetic Therapy
  • Genetic Vectors / genetics*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Software*
  • Transcription Initiation Site
  • Virus Integration / genetics*