Mapping of scaffold/matrix attachment regions in human genome: a data mining exercise

Nucleic Acids Res. 2019 Aug 22;47(14):7247-7261. doi: 10.1093/nar/gkz562.

Abstract

Scaffold/matrix attachment regions (S/MARs) are DNA elements that serve to compartmentalize the chromatin into structural and functional domains. These elements are involved in control of gene expression which governs the phenotype and also plays role in disease biology. Therefore, genome-wide understanding of these elements holds great therapeutic promise. Several attempts have been made toward identification of S/MARs in genomes of various organisms including human. However, a comprehensive genome-wide map of human S/MARs is yet not available. Toward this objective, ChIP-Seq data of 14 S/MAR binding proteins were analyzed and the binding site coordinates of these proteins were used to prepare a non-redundant S/MAR dataset of human genome. Along with co-ordinate (location) details of S/MARs, the dataset also revealed details of S/MAR features, namely, length, inter-SMAR length (the chromatin loop size), nucleotide repeats, motif abundance, chromosomal distribution and genomic context. S/MARs identified in present study and their subsequent analysis also suggests that these elements act as hotspots for integration of retroviruses. Therefore, these data will help toward better understanding of genome functioning and designing effective anti-viral therapeutics. In order to facilitate user friendly browsing and retrieval of the data obtained in present study, a web interface, MARome (http://bioinfo.net.in/MARome), has been developed.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites / genetics
  • Chromatin / genetics*
  • Chromatin / metabolism
  • Chromosome Mapping / methods
  • Computational Biology / methods
  • DNA / genetics*
  • DNA / metabolism
  • Data Mining / methods
  • Genome, Human / genetics*
  • Genomics / methods
  • Humans
  • Internet
  • Matrix Attachment Region Binding Proteins / genetics*
  • Matrix Attachment Region Binding Proteins / metabolism
  • Matrix Attachment Regions / genetics*
  • Protein Binding
  • Reproducibility of Results

Substances

  • Chromatin
  • Matrix Attachment Region Binding Proteins
  • DNA