Production of Proteins of the SARS-CoV-2 Proteome for Drug Discovery

ACS Omega. 2021 Jul 20;6(30):19983-19994. doi: 10.1021/acsomega.1c02984. eCollection 2021 Aug 3.

Abstract

The severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) is the causative agent of the coronavirus disease of 2019 (COVID-19). Its genome encodes two open reading frames for two large proteins, PP1a and PP1ab. Within the two polypeptide stretches, there are two proteases that process the large proteins into 15 discrete proteins essential for the assembly of the virion during its replication. We describe herein the cloning of the genes for these discrete proteins optimized for expression in Escherichia coli, production of the proteins, and their purification to homogeneity. These included all but six: NSP6, which possesses eight transmembrane regions, and five that are small proteins/peptides (E, ORF3b, ORF6, ORF7b, and ORF10). These proteins are intended for experimental validation of small-molecule binders as molecular template hits. The proof of concept was established with the ADP-ribosylhydrolase (ARH) domain of NSP3 in discovery of small-molecule templates that could serve as the basis for further optimization. The hit molecules include one submicromolar and a few low-micromolar binders to the ARH domain. Availability of these proteins in soluble forms opens up the opportunity for discoveries of novel templates with the potential for anti-COVID-19 pharmaceuticals.