A chromosome-scale genome assembly of European hazel (Corylus avellana L.) reveals targets for crop improvement

Plant J. 2021 Mar;105(5):1413-1430. doi: 10.1111/tpj.15099. Epub 2021 Jan 6.

Abstract

The European hazelnut (Corylus avellana L.) is a tree crop of economic importance worldwide, but especially for northern Turkey, where the majority of production takes place. Hazelnut production is currently challenged by environmental stresses, such as a recent outbreak of severe powdery mildew disease; furthermore, allergy to hazelnuts is an increasing health concern in some regions. In order to provide a foundation for using the available hazelnut genetic resources for crop improvement, we produced a fully assembled genome sequence and annotation for a hazelnut species, from C. avellana cv. 'Tombul', one of the most important Turkish varieties. A hybrid sequencing strategy, combining short reads, long reads and proximity ligation methods, enabled us to resolve heterozygous regions and produce a high-quality 370-Mb assembly that agrees closely with cytogenetic studies and genetic maps of the 11 C. avellana chromosomes, and covers 97.8% of the estimated genome size. The genome includes 27 270 high-confidence protein-coding genes, over 20 000 of which were functionally annotated based on homology with known plant proteins. We focused particularly on gene families encoding hazelnut allergens, and the Mildew resistance Locus O (MLO) proteins that are an important susceptibility factor for powdery mildew. The complete assembly enabled us to differentiate between members of these families and to identify homologues that may be important in mildew disease and hazelnut allergy. These findings provide examples of how the genome can be used to guide research and to develop effective strategies for crop improvement in C. avellana.

Keywords: Corylus avellana; European hazel; food allergy; genome assembly; powdery mildew disease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Corylus / genetics
  • Corylus / metabolism*
  • Genome Size / genetics
  • Plant Proteins / genetics
  • Plant Proteins / metabolism*

Substances

  • Plant Proteins