Improving the genome assembly of rabbits with long-read sequencing

Genomics. 2021 Sep;113(5):3216-3223. doi: 10.1016/j.ygeno.2021.05.031. Epub 2021 May 27.

Abstract

The European rabbit (Oryctolagus cuniculus) is important as a biomedical model given its unique features in immunity and metabolism. The current reference genome OryCun2.0 established with whole-genome shotgun sequencing was quite fragmented and had not been updated for ten years. In this work, we provided a new rabbit genome assembly UM_NZW_1.0 to improve OryCun2.0 by leveraging the contig lengths based on long-read sequencing and a wealth of available Illumina paired-end sequence data. UM_NZW_1.0 showed a remarkable increase of continuity compared with OryCun2.0, with 5 times longer contig N50 and approximately 75% gaps closed. Many of the closed gaps were overlapped with protein-coding genes or transcriptional features, resulting in an enhancement of gene annotations. In particular, UM_NZW_1.0 presented a more complete landscape of the MHC region and the IGH locus, therefore provided a valuable resource for future researches on rabbits.

Keywords: Gap closing; Long-read sequencing; Rabbit genomes; Reference assembly.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • High-Throughput Nucleotide Sequencing*
  • Molecular Sequence Annotation
  • Rabbits
  • Sequence Analysis, DNA
  • Whole Genome Sequencing