Mutational Frequencies of SARS-CoV-2 Genome during the Beginning Months of the Outbreak in USA

Pathogens. 2020 Jul 13;9(7):565. doi: 10.3390/pathogens9070565.

Abstract

SARS-CoV-2 has spread very quickly from its first reported case on 19 January 2020 in the United Stated of America, leading WHO to declare pandemic by 11 March 2020. RNA viruses accumulate mutations following replication and passage in human population, which prompted us to determine the rate and the regions (hotspots) of the viral genome with high rates of mutation. We analyzed the rate of mutation accumulation over a period of 11 weeks (submitted between 19th January to 15 April 2020) in USA SARS-CoV-2 genome. Our analysis identified that majority of the viral genes accumulated mutations, although with varying rates and these included NSP2, NSP3, RdRp, helicase, Spike, ORF3a, ORF8, and Nucleocapsid protein. Sixteen mutations accumulated in Spike protein in which four mutations are located in the receptor binding domain. Intriguingly, we identified a fair number of viral proteins (NSP7, NSP9, NSP10, NSP11, Envelop, ORF6, and ORF7b proteins), which did not accumulate any mutation. Limited changes in these proteins may suggest that they have conserved functions, which are essential for virus propagation. This provides a basis for a better understanding of the genetic variation in SARS-CoV-2 circulating in the US, which could help in identifying potential therapeutic targets for controlling COVID-19.

Keywords: SARS-CoV2; bioinformatics; genome; mutation.