Combined CRISPR toolkits reveal the domestication landscape and function of the ultra-long and highly repetitive silk genes

Acta Biomater. 2023 Mar 1:158:190-202. doi: 10.1016/j.actbio.2022.12.063. Epub 2023 Jan 2.

Abstract

Highly repetitive sequences play a major structural and function role in the genome. In the present study, we developed Cas9-assisted cloning and SMRT sequencing of long repetitive sequences (CACS) to sequence and manipulate highly repetitive genes from eukaryotic genomes. CACS combined Cas9-mediated cleavage of a target segment from an intact genome, Gibson assembly cloning, and PacBio SMRT sequencing. Applying CACS, we directly cloned and sequenced the complete sequences of fibroin heavy chain (FibH) genes from 17 domesticated (Bombyx mori) and 7 wild (Bombyx mandarina) silkworms. Our analysis revealed the unique fine structure organization, genetic variations, and domestication dynamics of FibH. We also demonstrated that the length of the repetitive regions determined the mechanical properties of silk fiber, which was further confirmed by Cas9 editing of FibH. CACS is a simple, robust, and efficient approach, providing affordable accessibility to highly repetitive regions of a genome. STATEMENT OF SIGNIFICANCE: Silkworm silk is the earliest and most widely used animal fiber, and its excellent performance mainly depends on the fibroin heavy chain (FibH) protein. The FibH gene is the main breakthrough in understanding the formation mechanism and improvement of silk fiber. In the study, we developed a CACS method for characterizing the fine structure and domestication landscape of 24 silkworm FibH genes. We used CRISPR/Cas9 to edit the repetitive sequence of FibH genes, revealing the relationship between FibH genes and mechanical properties of silkworm silk. Our study is helpful in modifying silk genes to manipulate other valuable highly repetitive sequences, and provides insight for silkworm breeding.

Keywords: CRISPR/Cas9; FibH gene; Highly repetitive sequence; Mechanical properties; SMRT sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bombyx* / genetics
  • CRISPR-Cas Systems
  • Domestication
  • Fibroins* / genetics
  • Silk / genetics

Substances

  • Fibroins
  • Silk