Sequence and expression analyses of KIX domain proteins suggest their importance in seed development and determination of seed size in rice, and genome stability in Arabidopsis

Mol Genet Genomics. 2013 Aug;288(7-8):329-46. doi: 10.1007/s00438-013-0753-9. Epub 2013 Jun 12.

Abstract

The KIX domain, which mediates protein-protein interactions, was first discovered as a motif in the large multidomain transcriptional activator histone acetyltransferase p300/CBP. Later, the domain was also found in Mediator subunit MED15, where it interacts with many transcription factors. In both proteins, the KIX domain is a target of activation domains of diverse transcription activators. It was found to be an essential component of several specific gene-activation pathways in fungi and metazoans. Not much is known about KIX domain proteins in plants. This study aims to characterize all the KIX domain proteins encoded by the genomes of Arabidopsis and rice. All identified KIX domain proteins are presented, together with their chromosomal locations, phylogenetic analysis, expression and SNP analyses. KIX domains were found not only in p300/CBP- and MED15-like plant proteins, but also in F-box proteins in rice and DNA helicase in Arabidopsis, suggesting roles of KIX domains in ubiquitin-mediated proteasomal degradation and genome stability. Expression analysis revealed overlapping expression of OsKIX_3, OsKIX_5 and OsKIX_7 in different stages of rice seeds development. Moreover, an association analysis of 136 in silico mined SNP loci in 23 different rice genotypes with grain-length information identified three non-synonymous SNP loci in these three rice genes showing strong association with long- and short-grain differentiation. Interestingly, these SNPs were located within KIX domain encoding sequences. Overall, this study lays a foundation for functional analysis of KIX domain proteins in plants.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Arabidopsis / genetics*
  • Base Sequence
  • Chromosome Mapping
  • Cluster Analysis
  • Gene Expression Profiling
  • Gene Order
  • Genomic Instability*
  • Molecular Sequence Data
  • Oryza / genetics*
  • Oryza / metabolism*
  • Phylogeny
  • Plant Proteins / chemistry
  • Plant Proteins / genetics*
  • Polymorphism, Single Nucleotide
  • Protein Interaction Domains and Motifs / genetics*
  • Reproducibility of Results
  • Seeds / genetics*
  • Seeds / growth & development
  • Seeds / metabolism*
  • Sequence Alignment

Substances

  • Plant Proteins