PSMD: An extensive database for pan-species microsatellite investigation and marker development

Mol Ecol Resour. 2020 Jan;20(1):283-291. doi: 10.1111/1755-0998.13098. Epub 2019 Oct 28.

Abstract

Microsatellites are widely distributed throughout nearly all genomes which have been extensively exploited as powerful genetic markers for diverse applications due to their high polymorphisms. Their length variations are involved in gene regulation and implicated in numerous genetic diseases even in cancers. Although much effort has been devoted in microsatellite database construction, the existing microsatellite databases still had some drawbacks, such as limited number of species, unfriendly export format, missing marker development, lack of compound microsatellites and absence of gene annotation, which seriously restricted researchers to perform downstream analysis. In order to overcome the above limitations, we developed PSMD (Pan-Species Microsatellite Database, http://big.cdu.edu.cn/psmd/) as a web-based database to facilitate researchers to easily identify microsatellites, exploit reliable molecular markers and compare microsatellite distribution pattern on genome-wide scale. In current release, PSMD comprises 678,106,741 perfect microsatellites and 43,848,943 compound microsatellites from 18,408 organisms, which covered almost all species with available genomic data. In addition to interactive browse interface, PSMD also offers a flexible filter function for users to quickly gain desired microsatellites from large data sets. PSMD allows users to export GFF3 formatted file and CSV formatted statistical file for downstream analysis. We also implemented an online tool for analysing occurrence of microsatellites with user-defined parameters. Furthermore, Primer3 was embedded to help users to design high-quality primers with customizable settings. To our knowledge, PSMD is the most extensive resource which is likely to be adopted by scientists engaged in biological, medical, environmental and agricultural research.

Keywords: database; genetic diversity; genetic marker; microsatellites; tandem repeats.

MeSH terms

  • Animals
  • Bacteria / classification
  • Bacteria / genetics
  • Databases, Genetic*
  • Eukaryota / classification
  • Eukaryota / genetics
  • Fungi / classification
  • Fungi / genetics
  • Genetic Markers
  • Microsatellite Repeats*
  • Phylogeny
  • Plants / classification
  • Plants / genetics
  • Viruses / classification
  • Viruses / genetics

Substances

  • Genetic Markers