In Silico Intensive Analysis for the E4 Gene Evolution of Human Adenovirus Species D

J Microbiol. 2024 Apr 30. doi: 10.1007/s12275-024-00132-1. Online ahead of print.

Abstract

Adenovirus (Ad) is a ubiquitous pathogen capable of infecting a wide range of animals and humans. Human Adenovirus (HAdV) can cause severe infection, particularly in individuals with compromised immune systems. To date, over 110 types of HAdV have been classified into seven species from A to G, with the majority belonging to the human adenovirus species D (HAdV-D). In the HAdV-D, the most significant factor for the creation of new adenovirus types is homologous recombination between viral genes involved in determining the virus tropism or evading immune system of host cells. The E4 gene, consisting of seven Open Reading Frames (ORFs), plays a role in both the regulation of host cell metabolism and the replication of viral genes. Despite long-term studies, the function of each ORF remains unclear. Based on our updated information, ORF2, ORF3, and ORF4 have been identified as regions with relatively high mutations compared to other ORFs in the E4 gene, through the use of in silico comparative analysis. Additionally, we managed to visualize high mutation sections, previously undetectable at the DNA level, through a powerful amino acid sequence analysis tool known as proteotyping. Our research has revealed the involvement of the E4 gene in the evolution of human adenovirus, and has established accurate sequence information of the E4 gene, laying the groundwork for further research.

Keywords: E4 gene; Human adenovirus; Proteotyping; Virus evolution.