Rapid genome functional annotation pipeline anchored to the house sparrow (Passer domesticus, Linnaeus 1758) genome reannotation

Biol Methods Protoc. 2023 Jul 7;8(1):bpad013. doi: 10.1093/biomethods/bpad013. eCollection 2023.

Abstract

The house sparrow (Passer domesticus) is a valuable avian model for studying evolutionary genetics, development, neurobiology, physiology, behavior, and ecology, both in laboratory and field-based settings. The current annotation of the P. domesticus genome available at the Ensembl Rapid Release site is primarily focused on gene set building and lacks functional information. In this study, we present the first comprehensive functional reannotation of the P. domesticus genome using intestinal Illumina RNA sequencing (RNA-Seq) libraries. Our revised annotation provides an expanded view of the genome, encompassing 38592 transcripts compared to the current 23574 transcripts in Ensembl. We also predicted 14717 protein-coding genes, achieving 96.4% completeness for Passeriformes lineage BUSCOs. A substantial improvement in this reannotation is the accurate delineation of untranslated region (UTR) sequences. We identified 82.7% and 93.8% of the transcripts containing 5'- and 3'-UTRs, respectively. These UTR annotations are crucial for understanding post-transcriptional regulatory processes. Our findings underscore the advantages of incorporating additional specific RNA-Seq data into genome annotation, particularly when leveraging fast and efficient data processing capabilities. This functional reannotation enhances our understanding of the P. domesticus genome, providing valuable resources for future investigations in various research fields.

Keywords: Passer domesticus; RNA-Seq; annotation pipeline; birds; gene annotation; intestine.