Divergence and evolution of cotton bHLH proteins from diploid to allotetraploid

BMC Genomics. 2018 Feb 23;19(1):162. doi: 10.1186/s12864-018-4543-y.

Abstract

Background: Polyploidy is considered a major driving force in genome expansion, yielding duplicated genes whose expression may be conserved or divergence as a consequence of polyploidization.

Results: We compared the genome sequences of tetraploid cotton (Gossypium hirsutum) and its two diploid progenitors, G. arboreum and G. raimondii, and found that the bHLH genes were conserved over the polyploidization. Oppositely, the expression of the homeolgous gene pairs was diversified. The biased homeologous proportion for bHLH family is significantly higher (64.6%) than the genome wide homeologous expression bias (40%). Compared with cacao (T. cacao), orthologous genes only accounted for a small proportion (41.7%) of whole cotton bHLHs family. The further Ks analysis indicated that bHLH genes underwent at least two distinct episodes of whole genome duplication: a recent duplication (1.0-60.0 million years ago, MYA, 0.005 < Ks < 0.312) and an old duplication (> 60.0 MYA, 0.312 < Ks < 3.0). The old duplication event might have played a key role in the expansion of the bHLH family. Both recent and old duplicated pairs (68.8%) showed a divergent expression profile, indicating specialized functions. The expression diversification of the duplicated genes suggested it might be a universal feature of the long-term evolution of cotton.

Conclusions: Overview of cotton bHLH proteins indicated a conserved and divergent evolution from diploids to allotetraploid. Our results provided an excellent example for studying the long-term evolution of polyploidy.

Keywords: Divergent expression; Duplicated pairs; Expansion; Orthologous genes; bHLH protein.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Basic Helix-Loop-Helix Transcription Factors / genetics*
  • Diploidy
  • Evolution, Molecular*
  • Gene Expression Regulation, Plant*
  • Gossypium / genetics*
  • Plant Proteins / genetics*
  • Polyploidy
  • Tetraploidy

Substances

  • Basic Helix-Loop-Helix Transcription Factors
  • Plant Proteins