Whole-genome sequencing and variant discovery of Citrus reticulata "Kinnow" from Pakistan

Funct Integr Genomics. 2023 Jul 8;23(3):227. doi: 10.1007/s10142-023-01153-6.

Abstract

Citrus is a source of nutritional and medicinal advantages, cultivated worldwide with major groups of sweet oranges, mandarins, grapefruits, kumquats, lemons and limes. Pakistan produces all major citrus groups with mandarin (Citrus reticulata) being the prominent group that includes local commercial cultivars Feutral's Early, Dancy, Honey, and Kinnow. The present study designed to understand the genetic architecture of this unique variety of Citrus reticulata 'Kinnow.' The whole-genome resequencing and variant calling was performed to map the genomic variability that might be responsible for its particular characteristics like taste, seedlessness, juice content, thickness of peel, and shelf-life. A total of 139,436,350 raw sequence reads were generated with 20.9 Gb data in Fastq format having 98% effectiveness and 0.2% base call error rate. Overall, 3,503,033 SNPs, 176,949 MNPs, 323,287 INS, and 333,083 DEL were identified using the GATK4 variant calling pipeline against Citrus clementina. Furthermore, g:Profiler was applied for annotating the newly found variants, harbor genes/transcripts and their involved pathways. A total of 73,864 transcripts harbors 4,336,352 variants, most of the observed variants were predicted in non-coding regions and 1009 transcripts were found well annotated by different databases. Out of total aforementioned transcripts, 588 involved in biological processes, 234 in molecular functions and 167 transcripts in cellular components. In a nutshell, 18,153 high impact variants and 216 genic variants found in the current study, which may be used after its functional validation for marker-assisted breeding programs of "Kinnow" to propagate its valued traits for the improvement of contemporary citrus varieties in the region.

Keywords: Citrus reticulata; GATK4 pipeline; Resequencing “Kinnow”; WG variant calling.

MeSH terms

  • Citrus* / genetics
  • Genome, Plant
  • Pakistan
  • Plant Breeding
  • Sequence Analysis, DNA