An appraisal of gene targets for phylogenetic classification of canine distemper virus: Is the hemagglutinin the best candidate?

Virus Res. 2023 Feb:325:199043. doi: 10.1016/j.virusres.2023.199043. Epub 2023 Jan 10.

Abstract

Sequence analysis of the canine distemper virus (CDV) hemagglutinin (H) gene may provide important insights on virus-host interactions and has also been frequently used for CDV phylogenetic classification. Herein, we performed an in silico analysis of CDV complete genomes (CGs) available in GenBank in order to investigate the suitability of H for CDV classification into lineages/genotypes. In addition, we analyzed the other viral genes for their potential use in CDV classification. Initially, we collected 116 CDV CGs from GenBank and compared their phylogenetic classification with that of their respective H nucleotide (nt) and amino acid (aa) sequences. Subsequently, we calculated the geodesic distance between the CG and H phylogenetic trees. These analyses were later performed with other CDV genes. All CDV CGs were also evaluated for possible recombination events. Nucleotide and aa analyses of H misclassified some Vaccine/America 1/Asia 3 lineage sequences compared to CG analysis, finding supported by both Maximum Likelihood (ML) and Bayesian Markov Chain Monte Carlo (B-MCMC) methods. Moreover, aa-based H analysis showed additional disagreements with the classification obtained by CG. The geodesic distance between the H and CG trees was 0.0680. Strong recombination signals were identified in the H gene, including Vaccine/America 1/Asia 3 lineage sequences. In contrast, C and P were the only genes that fully reproduced the CG classification (by ML and/or B-MCMC) and that did not show strong recombination signals. Furthermore, the P phylogenetic tree showed the lowest geodesic distance from the CG tree (0.0369). These findings suggest C and P as potential targets for CDV phylogenetic classification, especially when full genome sequencing is not possible. Finally, since our results were obtained considering the CDV CGs available to date, future analyses performed as more CDV sequences become available will be useful to assess probable issues of H-based phylogeny and to consolidate the suitability of the C and P genes for CDV classification.

Keywords: C gene; CDV; Genotypes; Lineages; P gene.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bayes Theorem
  • Distemper Virus, Canine* / genetics
  • Distemper*
  • Dogs
  • Hemagglutinins
  • Nucleotides
  • Phylogeny

Substances

  • Hemagglutinins
  • Nucleotides