Variation Patterns of NLR Clusters in Arabidopsis thaliana Genomes

Plant Commun. 2020 Jun 24;1(4):100089. doi: 10.1016/j.xplc.2020.100089. eCollection 2020 Jul 13.

Abstract

The nucleotide-binding domain and leucine-rich repeat (NLR) gene family is highly expanded in the plant lineage with extensive sequence and structure polymorphisms. To survey the landscape of NLR expansion, we mined the published long-read data generated by the resistance gene enrichment sequencing of 64 diverse Arabidopsis thaliana accessions. We found that the hot spots of massive multi-gene NLR cluster expansion did not typically span the whole cluster; instead, they were restricted to a handful of, or only one, dominant radiation(s). All sequences in such a radiation were distinct from other genes in the cluster but not from each other in the clade, making it difficult to assign trustworthy reference-based orthologies when multiple reference genes were present in the radiation. Consequently, NLR genes can be broadly divided into two types: radiating or high-fidelity, where high-fidelity genes are well conserved and well separated from other clades. A similar distinction could be made for NLR clusters, depending on whether cluster size was determined primarily by extensive radiation or the presence of numerous high-fidelity genes. We also identified groups of well-conserved NLR clades that were missing from the Columbia-0 reference genome. This suggests that the classification of NLRs using gene IDs from a single reference accession can rarely capture all major paralogs in a cluster accurately and representatively and that a reference-agnostic perspective is required to properly characterize these additional variations. Finally, we present a quantitative visualization method for differentiating these situations in a given clade of interest.

Keywords: NLR; cluster; disease resistance; evolution; phylogenetics; plant immunity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Arabidopsis Proteins / chemistry
  • Arabidopsis Proteins / genetics*
  • Arabidopsis Proteins / metabolism
  • Multigene Family / genetics*
  • NLR Proteins / chemistry
  • NLR Proteins / genetics*
  • NLR Proteins / metabolism

Substances

  • Arabidopsis Proteins
  • NLR Proteins