Long-read-based Genome Assembly of Drosophila gunungcola Reveals Fewer Chemosensory Genes in Flower-breeding Species

Genome Biol Evol. 2023 Mar 3;15(3):evad048. doi: 10.1093/gbe/evad048.

Abstract

Drosophila gunungcola exhibits reproductive activities on the fresh flowers of several plant species and is an emerging model to study the co-option of morphological and behavioral traits in male courtship display. Here, we report a near-chromosome-level genome assembly that was constructed based on long-read PacBio sequencing data (with ∼66× coverage) and annotated with the assistant from RNA-seq transcriptome data of whole organisms at various developmental stages. A nuclear genome of 189 Mb with 13,950 protein-coding genes and a mitogenome of 17.5 kb were acquired. Few interchromosomal rearrangements were found in the comparisons of synteny with Drosophila elegans, its sister species, and Drosophila melanogaster, suggesting that the gene compositions on each Muller element are evolutionarily conserved. Loss events of several OR and IR genes in D. gunungcola and D. elegans were revealed when orthologous genomic regions were compared across species in the D. melanogaster species group. This high-quality reference genome will facilitate further comparative studies on traits related to the evolution of sexual behavior and diet specialization.

Keywords: Drosophila gunungcola; PacBio sequencing; chemosensory genes; gene annotation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Drosophila melanogaster* / genetics
  • Drosophila* / genetics
  • Genome
  • Genomics
  • Molecular Sequence Annotation