Genome wide identification, classification and functional characterization of heat shock transcription factors in cultivated and ancestral cottons (Gossypium spp.)

Int J Biol Macromol. 2021 Jul 1:182:1507-1527. doi: 10.1016/j.ijbiomac.2021.05.016. Epub 2021 May 7.

Abstract

Heat shock transcription factors (HSF) have been demonstrated to play a significant transcriptional regulatory role in plants and considered as an integral part of signal transduction pathways against environmental stresses especially heat stress. Despite of their importance, HSFs have not yet been identified and characterized in all cotton species. In this study, we report the identification of 42, 39, 67, and 79 non-redundant HSF genes from diploid cottons G. arboreum (A2) and G. raimondii (D5), and tetraploid cottons G. barbadense (AD2) and G. hirsutum (AD1) respectively. The chromosome localization of identified HSFs revealed their random distribution on all the 13 chromosomes of A and D genomes of cotton with few regions containing HSFs in clusters. The genes structure and conserved domain analysis revealed the family-specific conservation of intron/exon organization and conserved domains in HSFs. Various abiotic stress-related cis-regulatory elements were identified from the putative promoter regions of cotton HSFs suggesting their possible role in mediating abiotic stress tolerance. The combined phylogenetic analysis of all the cotton HSFs grouped them into three subfamilies; with 145 HSFs belong to class A, 85 to class B, and 17 to class C subfamily. Moreover, a detailed analysis of HSF gene family in four species of cotton elucidated the role of allopolyploid and hybridization during evolutionary cascade of allotetraploid cotton. Comparatively, existence of more orthologous genes in cotton species than Arabidopsis, advocated that polyploidization produced new cotton specific orthologous gene clusters. Phylogenetic, collinearity and multiple synteny analyses exhibited dispersed, segmental, proximal, and tandem gene duplication events in HSF gene family. Duplication of gene events suggests that HSF gene family of cotton evolution was under strong purifying selection. Expression analysis revealed that GarHSF04 were found to be actively involved in PEG and salinity tolerance in G. arboreum. GhiHSF14 upregulated in heat and downregulated in salinity whilst almost illustrated similar behavior under cold and PEG treatments and GhiHSF21 exhibited down regulation almost across all the stresses in G. hirsutum. Overwhelmingly, present study paves the way to better understand the evolution of cotton HSF TFs and lays a foundation for future investigation of HSFs in improving abiotic stress tolerance in cotton.

Keywords: Fiber crop; Gene evolution; Gene expression; Heat stress; In-silico analysis; Phylogenetic analysis.

MeSH terms

  • Gene Expression Regulation, Plant
  • Gossypium / genetics
  • Gossypium / metabolism*
  • Heat Shock Transcription Factors / genetics
  • Heat Shock Transcription Factors / metabolism*
  • Heat-Shock Response / genetics
  • Heat-Shock Response / physiology
  • Multigene Family / genetics
  • Phylogeny
  • Plant Proteins / genetics
  • Plant Proteins / metabolism

Substances

  • Heat Shock Transcription Factors
  • Plant Proteins