ScienceMarch 30, 2020

Structural Modeling and Refinement of the SARS-CoV-2 S Protein

The novel coronavirus, SARS-CoV-2, has swept the globe and brought local economies…
Adam Green

The novel coronavirus, SARS-CoV-2, has swept the globe and brought local economies to a near crushing halt as the number of cases and deaths continue to rise. As of March 30, 2020, the total number of confirmed cases is over 739,000 with more than 35,000 deaths.1

Zoom into the Molecular Level

At the molecular level, the novel virus fuses with human cells by binding a protein receptor referred to as ACE2. The point of contact between the virus and ACE2 is referred to as the spike (S) protein. The S protein binds ACE2 in a specific conformation referred to as the ‘up’ conformation to initialize the fusion of the virus with the human cell. The mechanism of entry is analogous to other coronaviruses including SARs (Severe Acute Respiratory Syndrome). In addition, the genetic sequence of the SARs S protein is similar to SARS-CoV-2; nevertheless, previous research shows that biotherapeutics (e.g., monoclonal antibodies) that interact with the SARs S protein do not bind the S protein of SARS-CoV-2.2

Atomic-level Details Aid Rational Drug Design

Knowledge of the precise molecular interactions between the SARS-CoV-2 S protein and ACE2 can accelerate the rational discovery and design of new therapeutics that could disrupt the interaction between the S protein and ACE2. These new therapeutics could be novel small organic molecules or antibodies designed to form specific interactions with the S protein.

To discern molecular interactions that can advance the discovery of novel therapeutics, scientists  require an atomic-level description of the target protein. Fortunately, scientists at the University of Texas and the National Institutes of Health have recently resolved and published the SARS-CoV-2 protein structure.2 However, they did not generate this structure by experiment alone. Their structure determination required the use of predictive modeling software to build an initial model. They were then able to fit the experimental data of this model; however, some elements such as flexible loops and hydrogen atoms are not resolvable by the experimental fitting. As a result, these elements must be added post hoc using predictive modeling software prior to beginning rational design.

Modeling Can Quickly Enhance Experimental Structures

Wrapp et al. used a program called MODELER to construct the SARS-CoV-2 S protein structure from a  homology model. Homology modeling threads the protein sequence onto an existing structure that is homologous to the sequence. For the SARS-CoV-2 sequence, they used the homologous S protein from SARs to generate the initial model to which they fit experimental data to subsequently resolve the structure.2 However, the resulting structure refined by experimental data lacked key details including missing loops and hydrogen atoms. Predictive modeling software makes it possible to fill in these details.

The limits of resolution with most experimental methods make it impossible to resolve the positions of hydrogen atoms. As a result, we cannot determine the fully atomic-level description of the molecular interactions between the S protein, a potential therapeutic and/or the ACE2 receptor. Most predictive modeling programs offer tools for the assignment of hydrogen atom positions, refinement of interaction networks and addition/refinement of missing loops.3 We have re-built the initial model of the SARS-CoV-2  S protein from the SARs S protein superimposed onto the experimental structure, modeled the missing loops and assigned hydrogen atoms using MODELER in BIOVIA Discovery Studio (Figure 1).

. This offer will run through June 30, 2020.

  1. Worldometer. 17 March 2020. .
  2. Wrapp, David, et al. «Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation .» Science (2020): 1260-1263.
  3. 3DS BIOVIA. Structure Based Design. 2020.
  4. Schames, Julie R, et al. «Discovery of a Novel Binding Trench in HIV Integrase.» Med. Chem (2004): 1879-1881.

NOTE: This offer is limited to one license per research group. The Primary Investigator (PI) in each case should make the application.

Follow BIOVIA on LinkedIn for upcoming blog posts on other topics pertaining to BIOVIA Discovery Studio’s usefulness in COVID-19 research including protein structure determination with homology modeling, virtual screening/drug repurposing and pharmacophore modeling for lead identification.

Stay up to date

Receive monthly updates on content you won’t want to miss


Register here to receive a monthly update on our newest content.