Pattern discovery for microsatellite genome analysis

Comput Biol Med. 2014 Mar:46:71-8. doi: 10.1016/j.compbiomed.2014.01.002. Epub 2014 Jan 15.

Abstract

Microsatellite loci comprise an important part of eukaryotic genomes. Their applications in biology as genetic markers are related to numerous fields ranging from paternity analyses to construction of genetic maps and linkage to human disease. Existing software solutions which offer pattern discovery algorithms for the correct identification and downstream analysis of microsatellites are scarce and are proving to be inefficient to analyze large, exponentially increasing, sequenced genomes. Moreover, such analyses can be very difficult for bioinformatically inexperienced biologists. In this paper we present Microsatellite Genome Analysis (MiGA) software for the detection of all microsatellite loci in genomic data through a user friendly interface. The algorithm searches exhaustively and rapidly for most microsatellites. Contrary to other applications, MiGA takes into consideration the following three most important aspects: the efficiency of the algorithm, the usability of the software and the plethora of offered summary statistics. All of the above, help biologists to obtain basic quantitative and qualitative information regarding the presence of microsatellites in genomic data as well as downstream processes, such as selection of specific microsatellite loci for primer design and comparative genome analysis.

Keywords: Bioinformatics software; Genome analysis; Microsatellites; Mining methods; Pattern discovery; Simple sequence repeats.

MeSH terms

  • Databases, Nucleic Acid*
  • Genome, Human / physiology*
  • Humans
  • Microsatellite Repeats*
  • Pattern Recognition, Automated / methods*
  • Sequence Analysis, DNA / methods*
  • Software*