Triplet entropy analysis of hemagglutinin and neuraminidase sequences measures influenza virus phylodynamics

Gene. 2013 Oct 10;528(2):277-81. doi: 10.1016/j.gene.2013.06.060. Epub 2013 Jul 11.

Abstract

The influenza virus has been a challenge to science due to its ability to withstand new environmental conditions. Taking into account the development of virus sequence databases, computational approaches can be helpful to understand virus behavior over time. Furthermore, they can suggest new directions to deal with influenza. This work presents triplet entropy analysis as a potential phylodynamic tool to quantify nucleotide organization of viral sequences. The application of this measure to segments of hemagglutinin (HA) and neuraminidase (NA) of H1N1 and H3N2 virus subtypes has shown some variability effects along timeline, inferring about virus evolution. Sequences were divided by year and compared for virus subtype (H1N1 and H3N2). The nonparametric Mann-Whitney test was used for comparison between groups. Results show that differentiation in entropy precedes differentiation in GC content for both groups. Considering the HA fragment, both triplet entropy as well as GC concentration show intersection in 2009, year of the recent pandemic. Some conclusions about possible flu evolutionary lines were drawn.

Keywords: AT; Adenine and Thymine; C-phosphate-G-; CpG; Entropy; GC; Guanine and Cytosine fraction; H1N1; HA; Hemagglutinin; Influenza; NA; Neuraminidase; Power of hydrogen; RNA; Ribonucleic Acid; Sequence analysis; pH.

MeSH terms

  • Base Composition
  • Evolution, Molecular
  • Hemagglutinin Glycoproteins, Influenza Virus / genetics*
  • Humans
  • Influenza A Virus, H1N1 Subtype / genetics*
  • Influenza A Virus, H3N2 Subtype / genetics*
  • Models, Genetic
  • Neuraminidase / genetics*
  • Phylogeny
  • Sequence Analysis, DNA
  • Statistics, Nonparametric
  • Thermodynamics

Substances

  • Hemagglutinin Glycoproteins, Influenza Virus
  • Neuraminidase