Molecular Architecture of Early Dissemination and Massive Second Wave of the SARS-CoV-2 Virus in a Major Metropolitan Area

mBio. 2020 Oct 30;11(6):e02707-20. doi: 10.1128/mBio.02707-20.

Abstract

We sequenced the genomes of 5,085 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) strains causing two coronavirus disease 2019 (COVID-19) disease waves in metropolitan Houston, TX, an ethnically diverse region with 7 million residents. The genomes were from viruses recovered in the earliest recognized phase of the pandemic in Houston and from viruses recovered in an ongoing massive second wave of infections. The virus was originally introduced into Houston many times independently. Virtually all strains in the second wave have a Gly614 amino acid replacement in the spike protein, a polymorphism that has been linked to increased transmission and infectivity. Patients infected with the Gly614 variant strains had significantly higher virus loads in the nasopharynx on initial diagnosis. We found little evidence of a significant relationship between virus genotype and altered virulence, stressing the linkage between disease severity, underlying medical conditions, and host genetics. Some regions of the spike protein-the primary target of global vaccine efforts-are replete with amino acid replacements, perhaps indicating the action of selection. We exploited the genomic data to generate defined single amino acid replacements in the receptor binding domain of spike protein that, importantly, produced decreased recognition by the neutralizing monoclonal antibody CR3022. Our report represents the first analysis of the molecular architecture of SARS-CoV-2 in two infection waves in a major metropolitan region. The findings will help us to understand the origin, composition, and trajectory of future infection waves and the potential effect of the host immune response and therapeutic maneuvers on SARS-CoV-2 evolution.IMPORTANCE There is concern about second and subsequent waves of COVID-19 caused by the SARS-CoV-2 coronavirus occurring in communities globally that had an initial disease wave. Metropolitan Houston, TX, with a population of 7 million, is experiencing a massive second disease wave that began in late May 2020. To understand SARS-CoV-2 molecular population genomic architecture and evolution and the relationship between virus genotypes and patient features, we sequenced the genomes of 5,085 SARS-CoV-2 strains from these two waves. Our report provides the first molecular characterization of SARS-CoV-2 strains causing two distinct COVID-19 disease waves.

Keywords: COVID-19; COVID-19 disease; SARS-CoV-2; evolution; genome sequencing; molecular population genomics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Amino Acid Substitution
  • Antibodies, Neutralizing / immunology
  • Base Sequence
  • Betacoronavirus / genetics*
  • Betacoronavirus / immunology
  • COVID-19
  • COVID-19 Testing
  • Clinical Laboratory Techniques
  • Coronavirus Infections / diagnosis
  • Coronavirus Infections / epidemiology
  • Coronavirus Infections / immunology
  • Coronavirus Infections / virology*
  • Coronavirus RNA-Dependent RNA Polymerase
  • Genome, Viral
  • Genotype
  • Humans
  • Machine Learning
  • Models, Molecular
  • Molecular Diagnostic Techniques
  • Pandemics
  • Phylogeny
  • Pneumonia, Viral / epidemiology
  • Pneumonia, Viral / immunology
  • Pneumonia, Viral / virology*
  • RNA-Dependent RNA Polymerase / chemistry
  • RNA-Dependent RNA Polymerase / genetics
  • SARS-CoV-2
  • Sequence Analysis, Protein
  • Spike Glycoprotein, Coronavirus / chemistry*
  • Spike Glycoprotein, Coronavirus / genetics*
  • Spike Glycoprotein, Coronavirus / immunology
  • Texas / epidemiology
  • Viral Nonstructural Proteins / chemistry
  • Viral Nonstructural Proteins / genetics

Substances

  • Antibodies, Neutralizing
  • Spike Glycoprotein, Coronavirus
  • Viral Nonstructural Proteins
  • spike protein, SARS-CoV-2
  • Coronavirus RNA-Dependent RNA Polymerase
  • NSP12 protein, SARS-CoV-2
  • RNA-Dependent RNA Polymerase