One Step Closer to the Understanding of the Relationship IDR-LCR-Structure

Genes (Basel). 2023 Aug 28;14(9):1711. doi: 10.3390/genes14091711.

Abstract

Intrinsically disordered regions (IDRs) in protein sequences are emerging as functionally important elements for interaction and regulation. While being generally flexible, we previously showed, by observation of experimentally obtained structures, that they contain regions of reduced sequence complexity that have an increased propensity to form structure. Here we expand the universe of cases taking advantage of structural predictions by AlphaFold. Our studies focus on low complexity regions (LCRs) found within IDRs, where these LCRs have only one or two residue types (polyX and polyXY, respectively). In addition to confirming previous observations that polyE and polyEK have a tendency towards helical structure, we find a similar tendency for other LCRs such as polyQ and polyER, most of them including charged residues. We analyzed the position of polyXY containing IDRs within proteins, which allowed us to show that polyAG and polyAK accumulate at the N-terminal, with the latter showing increased helical propensity at that location. Functional enrichment analysis of polyXY with helical propensity indicated functions requiring interaction with RNA and DNA. Our work adds evidence of the function of LCRs in interaction-dependent structuring of disordered regions, encouraging the development of tools for the prediction of their dynamic structural properties.

Keywords: AlphaFold; homorepeats; intrinsically disordered regions; low complexity regions; polyX; polyXY; protein structure.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Protein Domains
  • RNA*

Substances

  • RNA

Grants and funding

This work received support from the following institutions: Mainz Institute of Multiscale Modeling (M3ODEL) for F.S. and M.A.A.N. for the position of M.G.K.