Statistical correlation of nonconservative substitutions of HIV gp41 variable amino acid residues with the R5X4 HIV-1 phenotype

Virol J. 2016 Feb 16:13:28. doi: 10.1186/s12985-016-0486-6.

Abstract

Background: The interaction of the envelope glycoprotein of HIV-1 (gp120/gp41) with coreceptor molecules has important implications for specific cellular targeting and pathogenesis. Experimental and theoretical evidences have shown a role for gp41 in coreceptor tropism, although there is no consensus about the positions involved. Here we analyze the association of physicochemical properties of gp41 amino acid residues with viral tropism (X4, R5, and R5X4) using a large set of HIV-1 sequences. Under the assumption that conserved regions define the complex structural features essential for protein function, we focused our search only on amino acids in the gp41 variable regions.

Methods: Gp41 amino acid sequences of 2823 HIV-1 strains from all clades with known coreceptor tropism were retrieved from Los Alamos HIV Database. Consensus sequences were constructed for homologous sequences (those obtained from the same patient and having the same tropism) in order to avoid bias due to sequence overrepresentation, and the variability (entropy) per site was determined. Comparisons of hydropathy index (HI) and charge (Q) of amino acid residues at highly variable positions between coreceptor groups were performed using two non-parametrical tests and Benjamini-Hochberg correction. Pearson's correlation analysis was performed to determine covariance of HI and Q values.

Results: Calculation of variability per site rendered 58 highly variable amino acid positions. Of these, statistical analysis rendered significantly different HI or Q only for the R5 vs. R5X4 comparison at twelve positions: 535, 602, 619, 636, 640, 641, 658, 662, 667, 723, 756 and 841. The largest differences in particular amino acid frequencies between coreceptor groups were found at 619, 636, 640, 641, 662, 723 and 756. A hydrophobic tendency of residues 619, 640, 641, 723 and 756, along with a hydrophilic/charged tendency at residues 636 and 662 was observed in R5X4 with respect to R5 sequences. HI of position 640 covariated with that of 602, 619, 636, 662, and 756.

Conclusions: Variability and significant correlations of physicochemical properties with viral phenotype suggest that substitutions at residues in the loop (602 and 619), the HR2 (636, 640, 641, 662), and the C-terminal tail (723, 756) of gp41 may contribute to phenotype of R5X4 strains.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Amino Acid Substitution*
  • Amino Acids
  • Genetic Variation*
  • HIV Envelope Protein gp120 / genetics
  • HIV Envelope Protein gp120 / metabolism
  • HIV Envelope Protein gp41 / chemistry
  • HIV Envelope Protein gp41 / genetics*
  • HIV Envelope Protein gp41 / metabolism
  • HIV Infections / metabolism
  • HIV Infections / virology*
  • HIV-1 / classification*
  • HIV-1 / physiology*
  • Humans
  • Phenotype
  • Receptors, CXCR4 / genetics*
  • Receptors, CXCR4 / metabolism
  • Receptors, CXCR5 / genetics*
  • Receptors, CXCR5 / metabolism
  • Viral Tropism

Substances

  • Amino Acids
  • HIV Envelope Protein gp120
  • HIV Envelope Protein gp41
  • Receptors, CXCR4
  • Receptors, CXCR5