AutoSeqMan: batch assembly of contigs for Sanger sequences

Zool Res. 2018 Mar 18;39(2):123-126. doi: 10.24272/j.issn.2095-8137.2018.027.

Abstract

With the wide application of DNA sequencing technology, DNA sequences are still increasingly generated through the Sanger sequencing platform. SeqMan (in the LaserGene package) is an excellent program with an easy-to-use graphical user interface (GUI) employed to assemble Sanger sequences into contigs. However, with increasing data size, larger sample sets and more sequenced loci make contig assemble complicated due to the considerable number of manual operations required to run SeqMan. Here, we present the 'autoSeqMan' software program, which can automatedly assemble contigs using SeqMan scripting language. There are two main modules available, namely, 'Classification' and 'Assembly'. Classification first undertakes preprocessing work, whereas Assembly generates a SeqMan script to consecutively assemble contigs for the classified files. Through comparison with manual operation, we showed that autoSeqMan saved substantial time in the preprocessing and assembly of Sanger sequences. We hope this tool will be useful for those with large sample sets to analyze, but with little programming experience. It is freely available at https://github.com/ Sun-Yanbo/autoSeqMan.

Keywords: Batch processing; Contig assembly; Sanger sequences; SeqMan.

MeSH terms

  • Animals
  • Autoanalysis / instrumentation
  • Autoanalysis / methods
  • Contig Mapping
  • High-Throughput Nucleotide Sequencing / instrumentation
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Sequence Analysis, DNA* / instrumentation
  • Sequence Analysis, DNA* / methods
  • User-Computer Interface

Grants and funding

The development of this package was supported by the National Natural Science Foundation of China (31671326) and the Youth Innovation Promotion Association, Chinese Academy of Sciences