Genetic polymorphism and evidence of signatures of selection in the Plasmodium falciparum circumsporozoite protein gene in Tanzanian regions with different malaria endemicity

medRxiv [Preprint]. 2024 Jan 23:2024.01.23.24301587. doi: 10.1101/2024.01.23.24301587.

Abstract

Background: In 2021 and 2023, the World Health Organization approved RTS, S/AS01 and R21/Matrix M malaria vaccines, respectively, for routine immunization of children in African countries with moderate to high transmission. These vaccines are made of Plasmodium falciparum circumsporozoite protein (Pfcsp) but polymorphisms in this gene raises concerns regarding strain-specific responses and the long-term efficacy of these vaccines. This study assessed the Pfcsp genetic diversity, population structure and signatures of selection among parasites from areas of different malaria transmission in mainland Tanzania, to generate baseline data before the introduction of the malaria vaccines in the country.

Methods: The analysis involved 589 whole genome sequences generated by and as part of the MalariaGEN Community Project. The samples were collected between 2013 and January 2015 from five regions of mainland Tanzania: Morogoro and Tanga (Muheza) (moderate transmission areas), and Kagera (Muleba), Lindi (Nachingwea), and Kigoma (Ujiji) (high transmission areas). Wright's inbreeding coefficient (Fws), Wright's fixation index (FST), principal component analysis, nucleotide diversity, and Tajima's D were used to assess within-host parasite diversity, population structure and natural selection.

Results: Based on Fws (< 0.95), there was high polyclonality (ranged from 69.23% in Nachingwea to 56.9% in Muheza). No population structure was detected in the Pfcsp gene in the five regions (mean FST= 0.0068). The average nucleotide diversity (π), nucleotide differentiation (K) and haplotype diversity (Hd) in the five regions were 4.19, 0.973 and 0.0035, respectively. The C-terminal region of Pfcsp showed high nucleotide diversity at Th2R and Th3R regions. Positive values for the Tajima's D were observed in the Th2R and Th3R regions consistent with balancing selection. The Pfcsp C-terminal sequences had 50 different haplotypes (H_1 to H_50) and only 2% of sequences matched the 3D7 strain haplotype (H_50).

Conclusions: The findings demonstrate high diversity of the Pfcsp gene with limited population differentiation. The Pfcsp gene showed positive Tajima's D values for parasite populations, consistent with balancing selection for variants within Th2R and Th3R regions. This data is consistent with other studies conducted across Africa and worldwide, which demonstrate low 3D7 haplotypes and little population structure. Therefore, additional research is warranted, incorporating other regions and more recent data to comprehensively assess trends in genetic diversity within this important gene. Such insights will inform the choice of alleles to be included in the future vaccines.

Keywords: Circumsporozoite protein; Plasmodium falciparum; Tanzania; genetic diversity; malaria vaccine; signature of selection.

Publication types

  • Preprint