Free fatty acid binding pocket in the locked structure of SARS-CoV-2 spike protein

Locking down the SARS-CoV-2 spike Many efforts to develop therapies against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are focused on the spike (S) protein trimer that binds to the host receptor. Structures of trimeric S protein show its receptor-binding domain in either an up or a down conformation. Toelzer et al. produced SARS-CoV-2 S in insect cells and determined the structure by cryo–electron microscopy. In their dataset, the closed form was predominant and was stabilized by binding linoleic acid, an essential fatty acid. A similar binding pocket appears to be present in previous highly pathogenic coronaviruses, and past studies suggested links between viral infection and fatty acid metabolism. The pocket could be exploited to develop inhibitors that trap S protein in the closed conformation. Science, this issue p. 725

SARS-CoV-2 has acquired functions that promote its harsh disease phenotype. SARS-CoV-2 causes severe inflammation and damage to endothelial cells in the heart, kidneys, liver, and intestines, suggestive of a vascular infection rather than a purely respiratory disease (3,4). The attachment of SARS-CoV-2 to a host cell is initiated by the spike (S) protein trimer, which adorns the outer surface of the virus, binding to its cognate receptor angiotensinconverting enzyme 2 (ACE2) with higher affin-ity than the SARS-CoV S protein (5)(6)(7). A S1-S2 polybasic furin protease cleavage site distinguishes SARS-CoV-2 from SARS-CoV and other closely related bat coronaviruses and serves to stimulate entry into host cells and cell-cell fusion (5,8,9). Inside the host cell, HCoVs remodel the lipid metabolism to facilitate virus replication (10). Infection by SARS-CoV-2 triggers an unusually impaired and dysregulated immune response (11) and a heightened inflammatory response (12), working in synergy with interferon production in the vicinity of infected cells to drive a feed-forward loop to upregulate ACE2 and further escalate infection (13).
In the search for additional functions that contribute to the pathology of infection, we determined the structure of the SARS-CoV-2 S glycoprotein by cryo-electron microscopy (cryo-EM) (Fig. 1). We produced SARS-CoV-2 S as a secreted trimer (14) in MultiBac (15) baculovirus-infected Hi5 insect cells (fig. S1) (16). Highly purified protein was used for cryo-EM data collection ( fig. S2 and table S1). After three-dimensional (3D) classification and refinement without applying symmetry (C1), we obtained a 3.0-Å closed conformation from 136,405 particles and a 3.5-Å open conformation with one receptor binding domain (RBD) in the up position from 57,990 particles (figs. S2 and S3). C3 symmetry was applied to the closed conformation particle pool, yielding a 2.85-Å map ( Fig. 1A and figs. S2 and S3).
The structure of S displays the characteristic overall shape observed for coronavirus S proteins in the closed and open conformations (17)(18)(19), with the closed form (~70%) predominating in our dataset ( Fig. 1A and figs. S2 to S4). Model building of the closed form revealed additional density in the RBDs in our structure (Fig. 1B). The tubelike shape of this density was consistent with a fatty acid, with size and shape similar to that of linoleic acid (LA) bound to other proteins ( Fig. 1B and  fig. S5) (20,21). Liquid chromatographycoupled electrospray ionization time-of-flight mass spectrometry (LC-MS) analysis confirmed the presence of a compound with the molecular weight of LA in our purified sample (Fig. 1C).
The hallmarks of free fatty acid (FFA) binding pockets in proteins are an extended "greasy" tube lined by hydrophobic amino acids, which accommodates the hydrocarbon tail, and a hydrophilic, often positively charged anchor for the acidic headgroup of the FFA. In our structure, a hydrophobic pocket mostly shaped by phenylalanines forms a bent tube into which the LA fits well (Fig. 1, D and E). The anchor for the headgroup carboxyl is provided by an arginine (Arg 408 ) and a glutamine (Gln 409 ) from the adjacent RBD in the trimer, giving rise to a composite LA binding site (Fig. 1E). We confirmed the presence of LA in all three binding pockets in the S trimer in the unsymmetrized (C1) closed structure ( fig. S6). Similarly, masked 3D classification focusing on the RBDs could not identify any unoccupied pockets ( fig. S7).
Our S construct contains alterations relative to native SARS-CoV-2 S-namely, addition of a trimerization domain and deletion of the polybasic cleavage site, neither of which alters the S conformation appreciably (14,17) (fig. S8). Glycosylation sites are located away from the LA binding pocket and are largely native in our structure (7,17) (table S2). Thus, neither glycosylation nor mutations are likely to affect the LA binding pocket. We compared S and RBD produced in insect cells with mammalianproduced S to identify any potential influence of differences in glycosylation on ACE2 binding by competition enzyme-linked immunosorbent assay (ELISA) ( Fig. 2A). All three reagents bound ACE2 efficiently. We then used size exclusion chromatography (SEC) with purified proteins to further confirm ACE2 binding by S (Fig. 2B). The LA binding pocket and the receptor binding motif (RBM) are distal and nonoverlapping (Fig. 2C). Notably, in the LAbound S, the RBM is ordered and buried at the interface between RBDs, whereas it was disordered in previously described SARS-CoV-2 S cryo-EM structures (7,17).
SARS-CoV-2 S can also adopt an open conformation ( fig. S4), which is compatible with binding ACE2. In previous apo S cryo-EM structures, about 60 to 75% of the S trimers were in the open conformation (7,17), in contrast to our observation of~70% in the closed conformation. This result could be due to LA stabilizing the closed conformation; if so, LA would be expected to reduce ACE2 binding. We performed surface plasmon resonance (SPR) experiments with biotinylated ACE2 immobilized on a streptavidin-coated chip *These authors contributed equally to this work. †Corresponding author. Email: cb14941@bristol.ac.uk (C.S.); imre.berger@bristol.ac.uk (I.B.) ( Fig. 2D and fig. S9). We first determined the dissociation constant (K D ) of the RBD-ACE2 interaction to validate our assay. Our value (26 nM; fig. S9C) is in good agreement with that of previous work [44 nM (22)], which used SPR with the RBD immobilized and ACE2 as an analyte. Apo S was prepared by applying Lipidex, the established method for removing lipids from lipid-binding proteins (23). A K D of 0.7 nM was obtained for the apo S-ACE2 interaction ( fig. S9A). For the LA-bound S-ACE2 interaction, we obtained a K D of 1.4 nM (fig. S9B). We consistently obtained a markedly reduced resonance unit signal for LA-bound S compared with apo S at the same concentrations ( Fig. 2D and fig. S9, A and B). This observation is consistent with the apo state having a higher percentage of S trimers in the open, ACE2-accessible conformation.
We characterized the affinity of the LA interaction both experimentally and computationally. Our SPR assays with immobilized RBD yielded a binding constant of~41 nM, exhibiting a slow off-rate, consistent with tight binding of LA (fig. S10). Repeated molecular dynamics simulations of the entire locked LA-bound S trimer (three 100-ns simulations) using GROMACS-2019 (24) corroborated the persistence of stable interactions between LA and the S trimer (movies S1 and S2). The affinity of LA binding to the S trimer will likely be higher than that of binding to the RBD alone, taking into account polar headgroup interactions with Arg 408 and Gln 409 of the adjacent RBD (Fig. 1E) was previously shown to affect coronavirus entry and replication (10). We administered remdesivir at concentrations of 20, 64, and 200 nM, supplementing with 50 mM LA (Fig.  2E). Our results revealed synergy, with the dose of remdesivir required to suppress SARS-CoV-2 replication markedly reduced by the addition of LA (Fig. 2, E and F). We superimposed our LA-bound structure on previous SARS-CoV-2 apo S structures in the closed conformation (7, 17) and identified a gating helix located directly at the entrance of the binding pocket (Fig. 3, A to C). This gating helix, which comprises Tyr 365 and Tyr 369 , is displaced by~6 Å when LA is bound, thus opening the pocket (Fig. 3, A and B). In the apo SARS-CoV-2 S trimer (7,17), a gap between adjacent RBDs places the hydrophilic anchor residues~10 Å from the position of the LA headgroup (Fig. 3C). Upon LA binding, the adjacent RBD in the trimer moves toward its neighbor, and the anchor residues Arg 408 and Gln 409 lock down on the headgroup of LA (Fig. 3, C and D). Overall, this results in a compaction of trimer architecture in the region formed by the three RBDs, producing a locked S structure (Fig. 3D and movie S4).
We investigated whether the LA-binding pocket is conserved in the seven coronaviruses that infect humans ( Fig. 4A and table S3). Sequence alignment shows that all residues lining the hydrophobic pocket and the anchor residues (Arg 408 and Gln 409 ) in SARS-CoV-2 are fully conserved in SARS-CoV (Fig. 4A). Structural alignment of LA-bound RBDs within the trimer of SARS-CoV-2 and apo SARS-CoV RBDs (19) reveals that the LA binding pocket is present in SARS-CoV. The greasy tube is flanked by a gating helix as in SARS-CoV-2, with Arg 395 and Gln 396 of SARS-CoV positioned 10 and 11 Å from the entrance, respectively, in a conformation that is virtually identical to that of apo SARS-CoV-2 (Figs. 3C and 4B). In MERS-CoV, the gating helix and hydrophobic residues lining the pocket are also present. Tyr 365 , Tyr 369 , and Phe 374 are substituted by likewise hydrophobic leucines and a valine, respectively (Fig. 4, A and C) (19). The Arg 408 -Gln 409 pair is not conserved; however, we identify Asn 501 , Lys 502 , and Gln 466 as potential anchor residues, located on a b sheet and an a helix within the adjacent RBD, up to 11 Å from the entrance (Fig. 4C). Thus, the greasy tube and hydrophilic anchor appear to be present in MERS-CoV, suggesting convergent evolution. In HCoV OC43, the gating helix and hydrophobic residues that line the pocket are largely conserved, whereas Tyr 365 , Tyr 369 , and Phe 374 are replaced by methionines and alanine, respectively (Fig. 4A) (18). Arg 413 is located on the same helix as Arg 408 and Gln 409 in SARS-CoV-2 and could serve as a hydrophilic anchor (Fig.  4D). No gap exists in this presumed apo form structure between the RBDs, which appear already in the locked conformation ( Fig. 4D  and fig. S11) (18). In HCoV HKU1, the hydrophobic residues are again largely conserved, but a charged residue (Glu 375 ) is positioned directly in front of the entrance, obstructing access for a putative ligand (Fig. 4E) (26). The RBDs of HCoVs 229E and NL63 adopt a very different fold ( fig. S13) (27,28), and many of the LA binding residues are not present (Fig.  4A), thus hampering predictions of a binding site for fatty acids.
In summary, we find four molecular features that mediate LA binding to SARS-CoV-2 and potentially also to SARS-CoV and MERS-CoV S proteins: a conserved hydrophobic pocket, a gating helix, amino acid residues prepositioned to interact with the LA carboxyl headgroup, and loosely packed RBDs in the apo form. By contrast, in each of the four common circulating HCoVs, it appears that one or more of these four architectural prerequisites is lacking in the S protein structures (Fig. 4 and figs. S11 and S12). down of the hydrophilic anchor and a compaction of the RBD trimer (Fig. 3, C and D). In addition to stabilizing the closed conformation, this lockdown could also help stabilize the S1 region, which includes the N-terminal domain and the RBD. The RBM, central to ACE2 binding, appears to be conformationally preorganized in our structure (Fig. 2C), indicating a generally more rigid RBD trimer when LA is bound. Although direct cross-talk between the LA binding pocket and the RBM is not apparent from our structure (Fig. 2C), the conformational changes in the RBD trimer (Fig. 3) could affect ACE2 docking and infectivity, as indicated by our SPR assays that show reduced levels of S binding in the presence of LA (Fig. 2D). The S protein's tight binding of LA originates from a well-defined size and shape complementarity afforded by the pocket (Fig. 1, B and D). The LA binding pocket thus presents a promising target for future development of small-molecule inhibitors that, for example, could irreversibly lock S in the closed conformation and interfere with receptor interactions. It is noteworthy in this context that a fatty acid binding pocket was exploited previously to develop potent small-molecule antiviral drugs to treat rhinovirus, locking viral surface proteins in a conformation incompatible with receptor binding (29,30). These antivirals were successful in human clinical trials (31,32).
A recent proteomic and metabolomic study of COVID-19 patient sera showed continuous decrease of FFAs, including LA (33). Lipid metabolome remodeling is a common element of viral infection (34,35). For coronaviruses, the LA-to-arachidonic acid metabolic pathway was identified as central to lipid remodeling (10). We hypothesize that LA sequestration by SARS-CoV-2 could confer a tissue-independent mechanism by which pathogenic coronavirus infection may drive immune dysregulation and inflammation (35)(36)(37). Our findings provide a direct structural link between LA, COVID-19 pathology, and the virus itself and suggest that both the LA binding pocket within the S protein and the multinodal LA signaling axis represent excellent therapeutic intervention points to treat SARS-CoV-2 infections.