Statistical models for detecting differential chromatin interactions mediated by a protein

PLoS One. 2014 May 16;9(5):e97560. doi: 10.1371/journal.pone.0097560. eCollection 2014.

Abstract

Chromatin interactions mediated by a protein of interest are of great scientific interest. Recent studies show that protein-mediated chromatin interactions can have different intensities in different types of cells or in different developmental stages of a cell. Such differences can be associated with a disease or with the development of a cell. Thus, it is of great importance to detect protein-mediated chromatin interactions with different intensities in different cells. A recent molecular technique, Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET), which uses formaldehyde cross-linking and paired-end sequencing, is able to detect genome-wide chromatin interactions mediated by a protein of interest. Here we proposed two models (One-Step Model and Two-Step Model) for two sample ChIA-PET count data (one biological replicate in each sample) to identify differential chromatin interactions mediated by a protein of interest. Both models incorporate the data dependency and the extent to which a fragment pair is related to a pair of DNA loci of interest to make accurate identifications. The One-Step Model makes use of the data more efficiently but is more computationally intensive. An extensive simulation study showed that the models can detect those differentially interacted chromatins and there is a good agreement between each classification result and the truth. Application of the method to a two-sample ChIA-PET data set illustrates its utility. The two models are implemented as an R package MDM (available at http://www.stat.osu.edu/~statgen/SOFTWARE/MDM).

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Chromatin / metabolism*
  • Computer Simulation
  • Humans
  • Models, Chemical
  • Models, Statistical*
  • Proteins / metabolism*
  • Software

Substances

  • Chromatin
  • Proteins

Grants and funding

LN and SL were supported by the National Science Foundation grant DMS-1042946. GL was supported by the Fundamental Research Funds for the Central Universities [2662014PY001]. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.