Specificity of MLL1 and TET3 CXXC domains towards naturally occurring cytosine modifications

Biochim Biophys Acta Gene Regul Mech. 2018 Dec;1861(12):1093-1101. doi: 10.1016/j.bbagrm.2018.10.009. Epub 2018 Oct 22.

Abstract

CXXC domains have traditionally been considered as CpG specific DNA binding domains that are repelled by cytosine modifications. This view has recently been challenged by the demonstration that CXXC domain of TET3 has relaxed sequence specificity and binds with the highest affinity to symmetric DNA duplex containing 5caCpG. Here, we present a comparative analysis of the MLL1-CXXC and TET3-CXXC sequence specificity and tolerance to cytosine modifications (5-methyl, 5-hydroxymethyl, 5-formyl, 5-carboxyl) in CpG and non-CpG context. For the first time, we take into consideration possible interference from cytosine bases elsewhere in the sequence. We show that despite similar overall structure, MLL1-CXXC has greater sequence and modification specificity than TET3-CXXC. MLL1-CXXC is specific only for CpG and does not tolerate any cytosine modifications. In contrast, TET3-CXXC does not require the CpG context of cytosine bases. Methyl-, formyl- and carboxyl-modifications are tolerated by TET3-CXXC, but only preceding G. Based on our and other data we propose a parsimonious model of MLL1-CXXC and TET3-CXXC DNA binding. This model explains why the binding of modified DNA duplexes by TET3-CXXC requires in some cases a register shift and is therefore context-dependent.

Keywords: CXXC domain; CpG island (CGI); Cytosine modifications; K-specific methyltransferase (KMT2); Mixed lineage leukemia (MLL); Ten-eleven-translocation (TET).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • CpG Islands
  • Cytosine / metabolism*
  • Dioxygenases / genetics*
  • Dioxygenases / metabolism
  • Escherichia coli / genetics
  • Histone-Lysine N-Methyltransferase / genetics*
  • Histone-Lysine N-Methyltransferase / metabolism
  • Models, Molecular
  • Myeloid-Lymphoid Leukemia Protein / genetics*
  • Myeloid-Lymphoid Leukemia Protein / metabolism
  • Protein Binding
  • Protein Domains

Substances

  • KMT2A protein, human
  • Myeloid-Lymphoid Leukemia Protein
  • Cytosine
  • TET3 protein, human
  • Dioxygenases
  • Histone-Lysine N-Methyltransferase