BING: biomedical informatics pipeline for Next Generation Sequencing

J Biomed Inform. 2010 Jun;43(3):428-34. doi: 10.1016/j.jbi.2009.11.003. Epub 2009 Nov 28.

Abstract

High throughput parallel genomic sequencing (Next Generation Sequencing, NGS) shifts the bottleneck in sequencing processes from experimental data production to computationally intensive informatics-based data analysis. This manuscript introduces a biomedical informatics pipeline (BING) for the analysis of NGS data that offers several novel computational approaches to 1. image alignment, 2. signal correlation, compensation, separation, and pixel-based cluster registration, 3. signal measurement and base calling, 4. quality control and accuracy measurement. These approaches address many of the informatics challenges, including image processing, computational performance, and accuracy. These new algorithms are benchmarked against the Illumina Genome Analysis Pipeline. BING is the one of the first software tools to perform pixel-based analysis of NGS data. When compared to the Illumina informatics tool, BING's pixel-based approach produces a significant increase in the number of sequence reads, while reducing the computational time per experiment and error rate (<2%). This approach has the potential of increasing the density and throughput of NGS technologies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Medical Informatics / methods*
  • Sequence Analysis, DNA / methods*
  • Software*