Visualization and quality assessment of de novo genome assemblies

Bioinformatics. 2011 Dec 15;27(24):3425-6. doi: 10.1093/bioinformatics/btr569. Epub 2011 Oct 12.

Abstract

Summary: Recent technological progress has greatly facilitated de novo genome sequencing. However, de novo assemblies consist in many pieces of contiguous sequence (contigs) arranged in thousands of scaffolds instead of small numbers of chromosomes. Confirming and improving the quality of such assemblies is critical for subsequent analysis. We present a method to evaluate genome scaffolding by aligning independently obtained transcriptome sequences to the genome and visually summarizing the alignments using the Cytoscape software. Applying this method to the genome of the red fire ant Solenopsis invicta allowed us to identify inconsistencies in 7%, confirm contig order in 20% and extend 16% of scaffolds.

Contact: oksana.ribagrognuz@unil.ch; yannick.wurm@unil.ch

Availability: Scripts that generate tables for visualization in Cytoscape from FASTA sequence and scaffolding information files are publicly available at https://github.com/ksanao/TGNet.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping*
  • Computational Biology
  • Genome
  • Sequence Analysis, DNA / methods*
  • Software
  • Synteny