Predicting Cross-Species Infection of Swine Influenza Virus with Representation Learning of Amino Acid Features

Comput Math Methods Med. 2021 Oct 11:2021:6985008. doi: 10.1155/2021/6985008. eCollection 2021.

Abstract

Swine influenza viruses (SIVs) can unforeseeably cross the species barriers and directly infect humans, which pose huge challenges for public health and trigger pandemic risk at irregular intervals. Computational tools are needed to predict infection phenotype and early pandemic risk of SIVs. For this purpose, we propose a feature representation algorithm to predict cross-species infection of SIVs. We built a high-quality dataset of 1902 viruses. A feature representation learning scheme was applied to learn feature representations from 64 well-trained random forest models with multiple feature descriptors of mutant amino acid in the viral proteins, including compositional information, position-specific information, and physicochemical properties. Class and probabilistic information were integrated into the feature representations, and redundant features were removed by feature space optimization. High performance was achieved using 20 informative features and 22 probabilistic information. The proposed method will facilitate SIV characterization of transmission phenotype.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Amino Acids / analysis
  • Amino Acids / genetics
  • Animals
  • Computational Biology
  • Host Specificity
  • Humans
  • Influenza A Virus, H1N1 Subtype / genetics
  • Influenza A Virus, H1N2 Subtype / genetics
  • Influenza A Virus, H3N2 Subtype / genetics
  • Influenza A virus / classification
  • Influenza A virus / genetics*
  • Influenza A virus / pathogenicity*
  • Influenza, Human / epidemiology
  • Influenza, Human / transmission
  • Influenza, Human / virology
  • Machine Learning
  • Models, Statistical
  • Mutation
  • Orthomyxoviridae Infections / veterinary*
  • Orthomyxoviridae Infections / virology
  • Pandemics
  • Risk Factors
  • Swine
  • Swine Diseases / transmission
  • Swine Diseases / virology*
  • Viral Proteins / chemistry
  • Viral Proteins / genetics

Substances

  • Amino Acids
  • Viral Proteins