Systematic identification and characterization of repeat sequences in African swine fever virus genomes

Vet Res. 2022 Dec 2;53(1):101. doi: 10.1186/s13567-022-01119-9.

Abstract

African swine fever virus (ASFV) is a large DNA virus that infects domestic pigs with high morbidity and mortality rates. Repeat sequences, which are DNA sequence elements that are repeated more than twice in the genome, play an important role in the ASFV genome. The majority of repeat sequences, however, have not been identified and characterized in a systematic manner. In this study, three types of repeat sequences, including microsatellites, minisatellites and short interspersed nuclear elements (SINEs), were identified in the ASFV genome, and their distribution, structure, function, and evolutionary history were investigated. Most repeat sequences were observed in noncoding regions and at the 5' end of the genome. Noncoding repeat sequences tended to form enhancers, whereas coding repeat sequences had a lower ratio of alpha-helix and beta-sheet and a higher ratio of loop structure and surface amino acids than nonrepeat sequences. In addition, the repeat sequences tended to encode penetrating and antimicrobial peptides. Further analysis of the evolution of repeat sequences revealed that the pan-repeat sequences presented an open state, showing the diversity of repeat sequences. Finally, CpG islands were observed to be negatively correlated with repeat sequence occurrences, suggesting that they may affect the generation of repeat sequences. Overall, this study emphasizes the importance of repeat sequences in ASFVs, and these results can aid in understanding the virus's function and evolution.

Keywords: ASFV; evolution; genetic diversity; repeat sequences.

MeSH terms

  • African Swine Fever Virus* / genetics
  • Amino Acids
  • Animals
  • Antimicrobial Peptides
  • Minisatellite Repeats
  • Sus scrofa
  • Swine

Substances

  • Amino Acids
  • Antimicrobial Peptides