SNPHunter: a bioinformatic software for single nucleotide polymorphism data acquisition and management

BMC Bioinformatics. 2005 Mar 18:6:60. doi: 10.1186/1471-2105-6-60.

Abstract

Background: Single nucleotide polymorphisms (SNPs) provide an important tool in pinpointing susceptibility genes for complex diseases and in unveiling human molecular evolution. Selection and retrieval of an optimal SNP set from publicly available databases have emerged as the foremost bottlenecks in designing large-scale linkage disequilibrium studies, particularly in case-control settings.

Results: We describe the architectural structure and implementations of a novel software program, SNPHunter, which allows for both ad hoc-mode and batch-mode SNP search, automatic SNP filtering, and retrieval of SNP data, including physical position, function class, flanking sequences at user-defined lengths, and heterozygosity from NCBI dbSNP. The SNP data extracted from dbSNP via SNPHunter can be exported and saved in plain text format for further down-stream analyses. As an illustration, we applied SNPHunter for selecting SNPs for 10 major candidate genes for type 2 diabetes, including CAPN10, FABP4, IL6, NOS3, PPARG, TNF, UCP2, CRP, ESR1, and AR.

Conclusion: SNPHunter constitutes an efficient and user-friendly tool for SNP screening, selection, and acquisition. The executable and user's manual are available at http://www.hsph.harvard.edu/ppg/software.htm

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Automation
  • Computational Biology / methods*
  • Computer Simulation
  • Database Management Systems
  • Databases, Genetic
  • Genetic Markers
  • Heterozygote
  • Humans
  • Information Storage and Retrieval
  • Internet
  • Linkage Disequilibrium
  • Models, Genetic
  • Polymorphism, Single Nucleotide*
  • Programming Languages
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Analysis, Protein
  • Software
  • Software Design
  • User-Computer Interface

Substances

  • Genetic Markers