Structures and nucleic acid-binding preferences of the eukaryotic ARID domain

Biol Chem. 2022 Feb 7;403(8-9):731-747. doi: 10.1515/hsz-2021-0404. Print 2022 Jul 26.

Abstract

The DNA-binding AT-rich interactive domain (ARID) exists in a wide range of proteins throughout eukaryotic kingdoms. ARID domain-containing proteins are involved in manifold biological processes, such as transcriptional regulation, cell cycle control and chromatin remodeling. Their individual domain composition allows for a sub-classification within higher mammals. ARID is categorized as binder of double-stranded AT-rich DNA, while recent work has suggested ARIDs as capable of binding other DNA motifs and also recognizing RNA. Despite a broad variability on the primary sequence level, ARIDs show a highly conserved fold, which consists of six α-helices and two loop regions. Interestingly, this minimal core domain is often found extended by helices at the N- and/or C-terminus with potential roles in target specificity and, subsequently function. While high-resolution structural information from various types of ARIDs has accumulated over two decades now, there is limited access to ARID-DNA complex structures. We thus find ourselves left at the beginning of understanding ARID domain target specificities and the role of accompanying domains. Here, we systematically summarize ARID domain conservation and compare the various types with a focus on their structural differences and DNA-binding preferences, including the context of multiple other motifs within ARID domain containing proteins.

Keywords: ARID containing proteins; AT-rich interacting domain; DNA-binding; DNA-protein complex; chromatin remodeling; structural biology.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • DNA / chemistry
  • DNA-Binding Proteins* / metabolism
  • Eukaryota* / metabolism
  • Mammals / genetics
  • Mammals / metabolism
  • Protein Binding
  • Sequence Alignment

Substances

  • DNA-Binding Proteins
  • DNA