CAVES: A Novel Tool for Comparative Analysis of Variant Epitope Sequences

Viruses. 2022 May 26;14(6):1152. doi: 10.3390/v14061152.

Abstract

In silico methods for immune epitope prediction have become essential for vaccine and therapeutic design, but manual intra-species comparison of putative epitopes remains challenging and subject to human error. Created initially for analyzing SARS-CoV-2 variants of concern, comparative analysis of variant epitope sequences (CAVES) is a novel tool designed to carry out rapid comparative analyses of epitopes amongst closely related pathogens, substantially reducing the required time and user workload. CAVES applies a two-level analysis approach. The Level-one (L1) analysis compares two epitope prediction files, and the Level-two (L2) analysis incorporates search results from the IEDB database of experimentally confirmed epitopes. Both L1 and L2 analyses sort epitopes into categories of exact matches, partial matches, or novel epitopes based on the degree to which they match with peptides from the compared file. Furthermore, CAVES uses positional sequence data to improve its accuracy and speed, taking only a fraction of the time required by manual analyses and minimizing human error. CAVES is widely applicable for evolutionary analyses and antigenic comparisons of any closely related pathogen species. CAVES is open-source software that runs through a graphical user interface on Windows operating systems, making it widely accessible regardless of coding expertise. The CAVES source code and test dataset presented here are publicly available on the CAVES GitHub page.

Keywords: antigenic variation; bioinformatics; comparative genomics; computational biology; epitope prediction; evolution; sequence analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • COVID-19*
  • Computational Biology / methods
  • Epitopes / genetics
  • Epitopes, T-Lymphocyte
  • Humans
  • SARS-CoV-2* / genetics
  • Software

Substances

  • Epitopes
  • Epitopes, T-Lymphocyte

Supplementary concepts

  • SARS-CoV-2 variants

Grants and funding

This research was partially funded (APC included) by the operating budget of the National Microbiology Laboratory, Public Health Agency of Canada, to which the authors are affiliated (K.L., P.S., and H.J.). K.L. is also the recipient of a research scholarship from Research Manitoba, Canada.