SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE

 
CONTINUE READING
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
REVIEWS
                                     SARS-​CoV-2 variants, spike mutations
                                     and immune escape
                                     William T. Harvey 1,2,8, Alessandro M. Carabelli 3,8, Ben Jackson 4,
                                     Ravindra K. Gupta5, Emma C. Thomson6,7, Ewan M. Harrison3,7, Catherine Ludden3,
                                     Richard Reeve 1, Andrew Rambaut 4, COVID-19 Genomics UK (COG-​UK) Consortium*,
                                     Sharon J. Peacock3 and David L. Robertson 2 ✉
                                     Abstract | Although most mutations in the severe acute respiratory syndrome coronavirus 2
                                     (SARS-​CoV-2) genome are expected to be either deleterious and swiftly purged or relatively
                                     neutral, a small proportion will affect functional properties and may alter infectivity, disease
                                     severity or interactions with host immunity. The emergence of SARS-​CoV-2 in late 2019 was
                                     followed by a period of relative evolutionary stasis lasting about 11 months. Since late 2020,
                                     however, SARS-​CoV-2 evolution has been characterized by the emergence of sets of mutations,
                                     in the context of ‘variants of concern’, that impact virus characteristics, including transmissibility
                                     and antigenicity, probably in response to the changing immune profile of the human population.
                                     There is emerging evidence of reduced neutralization of some SARS-​CoV-2 variants by
                                     postvaccination serum; however, a greater understanding of correlates of protection is
                                     required to evaluate how this may impact vaccine effectiveness. Nonetheless, manufacturers
                                     are preparing platforms for a possible update of vaccine sequences, and it is crucial that
                                     surveillance of genetic and antigenic changes in the global virus population is done alongside
                                     experiments to elucidate the phenotypic impacts of mutations. In this Review, we summarize the
                                     literature on mutations of the SARS-​CoV-2 spike protein, the primary antigen, focusing on their
                                     impacts on antigenicity and contextualizing them in the protein structure, and discuss them in
                                     the context of observed mutation frequencies in global sequence datasets.

Variants                            As of April 2021, severe acute respiratory syndrome        high-​effect mutations that contribute to virus adaption
In the context of viruses,          coronavirus 2 (SARS-​C oV-2), the causative agent of       and fitness do occur, they tend to be in the minority
genetically distinct viruses with   COVID-19, accounted for more than 143 million infec-       compared with tolerated low-​effect or no-​effect ‘neu-
mutations different from those      tions and more than three million deaths worldwide1.       tral’ amino acid changes4. A small minority of mutations
of other viruses. ‘Variants’ can
also refer to the founding virus
                                    Virus genomic sequences are being generated and            are expected to impact virus phenotype in a way that
of a cluster or lineage and         shared at an unprecedented rate, with more than one        confers a fitness advantage, in at least some contexts.
is used to refer collectively       million SARS-​CoV-2 sequences available via the Global     Such mutations may alter various aspects of virus biol-
to the resulting variants that      Initiative on Sharing All Influenza Data (GISAID), per-    ogy, such as pathogenicity, infectivity, transmissibility
form the lineage. Variants
                                    mitting near real-​time surveillance of the unfolding      and/or antigenicity. Although care has to be taken not
with changed biological
characteristics or antigenicity
                                    pandemic2. The use of pathogen genomes on this scale       to confound mutations being merely present in grow­
have been termed ‘variants          to track the spread of the virus internationally, study    ing lineages with mutations that change virus biology5,
of interest’, ‘variants under       local outbreaks and inform public health policy signi-     fitness-​enhancing mutations were first detected to
investigation’ or ‘variants of      fies a new age in virus genomic investigations3. Further   have arisen within a few months of the evolution of
concern’ by public health
bodies.
                                    to understanding epidemiology, sequencing enables          SARS-​CoV-2 within the human population. For exam-
                                    identification of emerging SARS-​CoV-2 variants and        ple, the spike protein amino acid change D614G was
                                    sets of mutations potentially linked to changes in viral   noted to be increasing in frequency in April 2020 and to
                                    properties.                                                have emerged several times in the global SARS-​CoV-2
✉e-​mail: david.l.robertson@            As highly deleterious mutations are rapidly purged,    population, and the coding sequence exhibits a high
glasgow.ac.uk                       most mutations observed in genomes sampled from cir-       dN/dS ratio, suggesting positive selection at the codon
https://doi.org/10.1038/            culating SARS-​CoV-2 virions are expected to be either     position 614 (refs6,7). Subsequent studies indicated that
s41579-021-00573-0                  neutral or mildly deleterious. This is because although    D614G confers a moderate advantage for infectivity8,9

Nature Reviews | Microbiology

                                                                  0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

Mutations
                                      and transmissibility10. Several other spike mutations of         from infection18. Another RBM amino acid change,
The substitutions, insertions         note have now arisen and are discussed in this Review,           Y453F — associated with increased ACE2-​binding
or deletions of one or more           with particular focus on mutations affecting antigenicity.       affinity19 — received considerable attention following
nucleotides in the virus RNA              The extent to which mutations affecting the anti-            its identification in sequences associated with infec-
genome. Non-​synonymous
nucleotide substitutions in
                                      genic phenotype of SARS-​CoV-2 will enable variants              tions in humans and mink; most notably one lineage
protein-​coding sequence              to circumvent immunity conferred by natural infection            identified in Denmark and initially named ‘cluster 5’
result in a change in amino acid      or vaccination remains to be determined. However,                (now B.1.1.298)20. As of 5 November 2020, 214 humans
(referred to as a substitution        there is growing evidence that mutations that change             infected with SARS-​CoV-2 related to mink were all car-
or replacement), whereas
                                      the antigenic phenotype of SARS-​C oV-2 are circu-               rying the mutation Y453F21. The B.1.1.298 lineage also
synonymous nucleotide
substitutions do not change
                                      lating and affect immune recognition to a degree that            has Δ69–70, an amino-​terminal domain (NTD) dele-
the amino acid.                       requires immediate attention. The spike protein medi-            tion that has emerged several times across the global
                                      ates attachment of the virus to host cell-​surface recep-        SARS-​C oV-2 population, including in the second
Lineages                              tors and fusion between virus and cell membranes11               N439K lineage, B.1.258. Δ69–70 is predicted to alter the
Monophyletic clusters of
                                      (Box 1). It is also the principal target of neutralizing anti-   conformation of an exposed NTD loop and has been
viruses assigned on the basis
of the severe acute reparatory        bodies generated following infection by SARS-​CoV-2              reported to be associated with increased infectivity22.
syndrome coronavirus 2                (refs12,13), and is the SARS-​CoV-2 component of both                 Genomic analyses indicate a change in host environ­
(SARS-​CoV-2) global                  mRNA and adenovirus-​based vaccines licensed for use             ment and signatures of increased selective pressures
phylogenetic tree.
                                      and others awaiting regulatory approval14. Consequently,         acting upon immunologically important SARS-​CoV-2
dN/dS ratio
                                      mutations that affect the antigenicity of the spike pro-         genes sampled from around November 2020 (ref.23). This
The ratio of non-​synonymous          tein are of particular importance. In this Review, we            coincided with the emergence of variants with higher
mutations per non-​synonymous         explore the literature on these mutations and their              numbers of mutations relative to previous circulating
site (dN) to synonymous               antigenic consequences, focusing on the spike protein            variants. These lineages because of their association
mutations per synonymous site
(dS), which is used to estimate
                                      and antibody-​mediated immunity, and discuss them in             with increased transmissibility were named ‘variants of
the balance between neutral           the context of observed mutation frequencies in global           concern’. They are defined by multiple convergent muta-
mutations, purifying selection        sequence datasets.                                               tions that are hypothesized to have arisen either in the
and positive selection acting                                                                          context of chronic infections or in previously infected
on gene or a specific codon.
                                      Spike mutations receiving early attention                        individuals24–29. In addition to understanding the trans-
Amino acid substitution
                                      The rate of evolution of SARS-​CoV-2 from December               missibility and pathogenicity of these emerging variants,
A change in a specific amino          2019 to October 2020 was consistent with the virus               it is crucially important to characterize their antigenicity
acid of a protein. This is caused     acquiring approximately two mutations per month in               and the level of cross-​protection provided by infection by
by non-​synonymous mutations.         the global population15,16. Although our understanding           earlier viruses that are genetically and antigenically sim-
By convention, an amino acid
substitution is written in the
                                      of the functional consequences of spike mutations is             ilar to the virus that first emerged in December 2019 and
form N501Y to denote                  rapidly expanding, much of this knowledge involves the           which is used in all of the current vaccine preparations.
the wild-​type amino acid             reactive investigation of amino acid changes identified as       Information on how spike mutations affect antigenic
(N (asparagine)) and the              rapidly increasing in frequency or being associated with         profiles can be derived from structural studies, mutations
substituted amino acid
                                      unusual epidemiological characteristics. Following the           identified in viruses exposed to mAbs or plasma con-
(Y (tyrosine)) at site 501 in
the amino acid sequence.
                                      emergence of D614G, an amino acid substitution within            taining polyclonal antibodies, targeted investigations of
                                      the receptor-​binding motif (RBM), N439K, was noted              variants using site-​directed mutagenesis and deep muta-
Monoclonal antibodies                 as increasing in frequency in Scotland in March 2020.            tional scanning (DMS) experiments that systematically
(mAbs). Antibodies made               Whereas this first lineage with N439K (designated                investigate the possibility of mutations arising.
by cloning a unique white
blood cell, which usually has
                                      B.1.141 with the Pango nomenclature system17) quickly
monovalent binding affinity for       became extinct, another lineage that independently               Immunogenic regions of spike
a specific epitope. Virus particles   acquired N439K (B.1.258) emerged and circulated                  Several studies have probed the antigenicity of
can be saturated with mAbs,           widely in many European countries18. N439K is note-              the SARS-​C oV-2 spike protein by epitope mapping
and the structure can be solved
                                      worthy as it enhances the binding affinity for the ACE2          approaches, including solving the structure of the spike
to determine the antibody
footprint or mAbs can be used
                                      receptor and reduces the neutralizing activity of some           protein in complex with the antigen-​binding fragment
to select for mutations that          monoclonal antibodies (mAbs) and polyclonal antibod-             of particular antibodies13,30–32. Serological analyses of
escape recognition.                   ies present in sera from people who have recovered               almost 650 individuals infected with SARS-​C oV-2
                                                                                                       indicated that ~90% of the plasma or serum neutraliz-
                                                                                                       ing antibody activity targets the spike receptor-​binding
 Author addresses
                                                                                                       domain (RBD)12. A relative lack of glycan shielding may
 1
  institute of Biodiversity, animal Health and Comparative Medicine, College of Medical,               contribute to the immunodominance of the RBD33. One
 veterinary and Life sciences, university of Glasgow, Glasgow, uK.                                     study reported structural, biophysical and bioinformat-
 2
  MrC–university of Glasgow Centre for virus research, Glasgow, uK.                                    ics analyses of 15 SARS-​CoV-2 RBD-​binding neutral-
 3
  Department of Medicine, university of Cambridge, Cambridge, uK.                                      izing antibodies31. Antibody footprints were generated
 4
  institute of evolutionary Biology, university of edinburgh, edinburgh, uK.                           by structural analyses of the spike residues considering
 5
  Cambridge institute of therapeutic immunology and infectious Disease, university of
                                                                                                       potential hydrogen bonds and van der Waals interactions
 Cambridge, Cambridge, uK.
 6
  Department of Clinical research, London school of Hygiene and tropical Medicine,                     with a mAb atom that were less than 4.0 Å. Structural
 London, uK.                                                                                           analyses allowed the categorization of RBD-​binding
 7
  wellcome sanger institute, Hinxton, uK.                                                              neutra­lizing antibodies into four classes (Fig. 1a,b): ACE2-​
 8
  these authors contributed equally: william t. Harvey, alessandro M. Carabelli.                       blocking antibodies that bind the spike protein in the
 *a list of members and their affiliations appears in supplementary information.                       open conformation (class 1); ACE2-​blocking antibodies

                                                                                                                                         www.nature.com/nrmicro

                                                                          0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

 Box 1 | Spike protein structure and function                                                       mAbs39 or convalescent plasma containing polyclonal
                                                                                                    antibodies40,41; targeted characterization of particular
 as with other coronaviruses, the entry of severe acute respiratory syndrome                        mutations18,42; and wider investigations of either large
 coronavirus 2 (sars-​Cov-2) into host cells is mediated by the transmembrane spike                 numbers of circulating variants43 or all possible amino
 glycoprotein, which forms homotrimers on the surface of the virion. the sars-​Cov-2
                                                                                                    acid substitutions in the RBD39,44–46. For spike residues
 spike protein is highly glycosylated, with 66 potential N-​glycosylation sites per
                                                                                                    where mutations have been shown to influence poly-
 trimer98,99 (Fig. 4a).the sars-​Cov-2 spike protein is post-​translationally cleaved
 by mammalian furin into two subunits: s1 and s2 (Fig. 4a). the s1 subunit largely                  clonal antibody recognition, the observation of an effect
 consists of the amino-​terminal domain and the receptor-​binding domain (rBD), and                 on either mAbs or plasma is indicated in Fig. 1b. For a
 is responsible for binding to the host cell-​surface receptor, aCe2, whereas the s2                smaller number of residues, escape mutations emerg-
 subunit includes the trimeric core of the protein and is responsible for membrane fusion           ing in virus exposed to mAbs or polyclonal plasma have
 (Fig. 4b). the presence of a polybasic furin cleavage site at the s1–s2 boundary, which            been described (‘mAb emerge’ and ‘plasma emerge’ in
 is unique within the subgenus Sarbecovirus, is important for infectivity and virulence100,         Fig. 1b).
 with furin cleavage facilitating the conformational change required for receptor                       In a DMS study, researchers assessed all possible
 binding50. the spike protein transiently undergoes conformational changes between a                single amino acid variants using a yeast-​display system
 closed conformation and an open conformation in which a hinge-​like movement raises                and detected variants that escape either nine neutraliz-
 the rBD50. the residues comprising the receptor-​binding motif are revealed on the
                                                                                                    ing SARS-​CoV-2 mAbs45 or convalescent plasma from
 upright rBD, enabling binding to aCe2, which induces a progressively more open
 structure until a fully open, three-​aCe2-​bound structure is formed, initiating s2                11 individuals taken at two time points after infection39
 unsheathing and membrane fusion101.                                                                (shades of green in Fig. 1b). The resulting heat maps
                                                                                                    provide rich data on the antigenic consequence of RBD
                                                                                                    mutations, with the plasma escape mutations being of
                                     that bind the RBD in both the open conformation and            particular interest given that they impact neutraliza-
                                     the closed conformation (class 2); antibodies that do not      tion by polyclonal antibodies of the kind SARS-​CoV-2
                                     block ACE2 and bind the RBD in both the open con-              encounters in infections, with significant levels of immu-
Epitope mapping
Experimental determination
                                     formation and the closed conformation (class 3); and           nity acquired through prior exposure or vaccination.
of the binding site, or epitope,     neutralizing antibodies that bind outside the ACE2 site        Although significant interperson and intraperson hetero­
of an antibody. Approaches           and only in the open conformation (class 4)31. Within          geneity in the impact of mutations on neutralization by
include X-​ray co-​crystallography   the RBD, RBM epitopes overlapping the ACE2 site are            polyclonal serum has been described, the mutations that
or cryogenic electron
                                     immunodominant, whereas other RBD sites generate               reduce antibody binding the most occur at a relatively
microscopy of an antigen–
antibody complex and the             lower and variable responses in different individuals12.       small number of RBD residues, indicating substantial
mapping of systematic                    Although the RBD is immunodominant, there is               immunodominance within the RBD39.
mutations introduced by              evidence for a substantial role of other spike regions in          Of all RBD residues for which substitutions affected
site-​directed mutagenesis.          antigenicity, most notably the NTD13,30,34. Early struc-       recognition by convalescent sera, DMS identified E484
Glycan shielding
                                     tural characterization of NTD-​specific antibodies 4A8         as being of principal importance, with amino acid
The process by which a virus         (ref.32) and 4–8 (ref.13) revealed similar epitope locations   changes to K, Q or P reducing neutralization titres by
can cloak underlying protein,        towards the upper side of the most prominently protrud-        more than an order of magnitude39. E484K has also been
impeding antibody binding.           ing area of the NTD. Cryogenic electron microscopy was         identified as an escape mutation that emerges during
This is mediated by glycans,
                                     used to determine the antibody footprint of the neutral-       exposure to mAbs C121 and C144 (ref.40) and convales-
bulky sugar molecules that
are covalently attached to           izing antibody 4A8, and showed key interactions involv-        cent plasma41, and was the only mutation described in
amino acid side chains of            ing spike residues Y145, H146, K147, K150, W152, R246          one study as able to reduce the neutralizing ability of a
the viral protein.                   and W258 (ref.32). Epitope binning of 41 NTD-​specific         combination of mAbs (REGN10989 and REGN10934)
                                     mAbs led to the identification of six antigenic sites,         to an unmeasurable level47. In an escape mutation study
Immunodominance
The phenomenon by
                                     one of which is recognized by all known NTD-​specific          using 19 mAbs, substitutions at E484 emerged more
which the host immune                neutralizing antibodies and has been termed the ‘NTD           frequently than at any other residue (in response to
response against a viral             supersite’, consisting of residues 14–20, 140–158 and          four mAbs), and each of the four 484 mutants identi-
particle is mostly focused on        245–264 (ref.30) (Fig. 1a,b). The mechanism of neutraliza-     fied (E484A, E484D, E484G and E484K) subsequently
a few antigens and mediated
                                     tion by which NTD-​specific antibodies act remains to be       conferred resistance to each of four convalescent sera
by potently neutralizing
antibodies.                          fully determined, although it may involve the inhibition       tested48. No other mAb-​selected escape mutants escaped
                                     of conformational changes or proposed interactions with        each of the four sera, although the mutations K444E,
Antibody footprints                  auxiliary receptors such as DC-​SIGN or L-​SIGN32,35.          G446V, L452R and F490S escaped three of the four sera
Amino acid residues of a             Relatively little is known of antigenicity in the S2 sub-      tested48.
3D folded protein that are
targeted and contacted by
                                     unit, with immunogenicity thought to be impeded by                 Mutations at position 477 of the spike protein
a binding antibody.                  extensive glycan shielding36, and although both linear         (S477G, S477N and S477R) rank prominently among
                                     and cross-​reactive conformational S2 epitopes have been       mAb escape mutations identified by one study, and the
Glycoprotein                         described37,38, the biological significance of these is not    mutation S477G conferred resistance to two of the four
A protein with oligosaccharide
                                     yet known.                                                     sera tested48. However, substitutions at 477 were not
chains (glycans) covalently
attached to amino acid                                                                              identified as being important in DMS with convales-
side chains. Virus surface           Spike RBD mutations and immune escape                          cent plasma39. The mutation N439K increases affinity
glycoproteins embedded in the        Several studies have contributed to the current under-         for ACE2 (ref.19), is predicted to result in an additional
membrane often have a role           standing of how mutations in the SARS-​CoV-2 spike             salt bridge at the RBM–ACE2 interface and is thought
in interactions with host cells,
including receptor binding,
                                     protein affect neutralization. These studies include           to preferentially reduce the neutralization potential
and are also commonly                traditional escape mutation work that identifies muta-         of plasma that already has low neutralizing activity18.
targeted by host antibodies.         tions that emerge in virus populations exposed to either       However, a DMS study39 did not find that the mutation

Nature Reviews | Microbiology

                                                                     0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

a                                                          b 140                                                  x                         xx                                                                                           140
                                                               148                                                                                                                                                                       148
                                                               150                                                                                                                                                                       150
                                                               151                                                                                                                                                                       151
                                                               345                                                                                                                                                                       345
                                                               346                                                                                                                                                                       346
                                                               352                                                                                                                                                                       352
                                                               378                                                                                                                                                                       378
                                                               406                                                                                                                                                                       406   Antibody class
                                                               408                                                                                                                                                                       408      NTD
                                                               417                                                                                                                                                                       417      RBD 1
                                                               439                                                                                                                                                                       439      RBD 2
                                                               441                                                                                                                                                                       441      RBD 3
                                                               443                                                                                                                                                                       443      RBD 4
                                                               444                                                                                                                                                                       444
                                                               445                                                                                                                                                                       445   RBS distance (Å)
                                                               446                                                                                                                                                                       446      150
                                                               447                                                                                                                                                                       447      100
                                                               448                                                                                                                                                                       448      50
                                                               449                                                                                                                                                                       449      0
                                                               450                                                                                                                                                                       450
                                                               452                                                                                                                                                                       452   Described
                                                               453                                                                                                                                                                       453      No
                                                               455                                                                                                                                                                       455      Yes
                                   180º
                                                               456                                                                                                                                                                       456   x  Yes (deletion)
                                                               458                                                                                                                                                                       458
                                                               472                                                                                                                                                                       472   Plasma escape
                                                                                                                                                                                                                                               fraction (DMS)
                                                               473                                                                                                                                                                       473
                                                                                                                                                                                                                                                   1
                                                               474                                                                                                                                                                       474
                                                                                                                                                                                                                                                   0.8
                                                               476                                                                                                                                                                       476
                                                                                                                                                                                                                                                   0.6
                                                               477                                                                                                                                                                       477
                                                                                                                                                                                                                                                   0.4
                                                               478                                                                                                                                                                       478
                                                               479                                                                                                                                                                       479       0.2
                                                               483                                                                                                                                                                       483       0
                                                               484                                                                                                                                                                       484
                                                                                                                                                                                                                                               ACE2 binding (DMS)
                                                               485                                                                                                                                                                       485
                                                                                                                                                                                                                                                   1
                                                               486                                                                                                                                                                       486
                                                                                                                                                                                                                                                   0
                                                               487                                                                                                                                                                       487
                                                                                                                                                                                                                                                  –1
                                                               489                                                                                                                                                                       489
                                                               490                                                                                                                                                                       490      –2
                                                               493                                                                                                                                                                       493      –3
                                                               494                                                                                                                                                                       494
                                                               496                                                                                                                                                                       496
                                                               499                                                                                                                                                                       499
                                                               501                                                                                                                                                                       501
                                                               503                                                                                                                                                                       503
                                                               504                                                                                                                                                                       504
                                                               519                                                                                                                                                                       519
                                                               Amino acid position

                                                                                              RBS distance (Å)

                                                                                                                 mAb effect
                                                                                                                             mAb emerge
                                                                                                                                          Plasma effect
                                                                                                                                                         Plasma emerge

                                                                                                                                                                         Plasma average

                                                                                                                                                                                          Plasma max

                                                                                                                                                                                                         Binding average

                                                                                                                                                                                                                           Binding max

                                                                                                                                                                                                       DMS

                                     N439K significantly alters neutralization by polyclonal                                                             Spike NTD mutations and immune escape
                                     antibodies in plasma, in contrast to previous studies                                                               In the NTD, most of the evidence for immune evasion
                                     that found that N439K reduced neutralization by mAbs                                                                focuses on a region centred at a conformational epitope
                                     and convalescent plasma18. One explanation for this                                                                 consisting of residues 140–156 (N3 loop) and 246–260
                                     inconsistency is that the mechanism of immune escape                                                                (N5 loop), which includes the epitope of the antibody
                                     conferred by N439K is through increased ACE2 affinity                                                               4A832 (Fig. 1, magenta). In studies that identified the
Epitopes                             rather than by directly affecting antibody epitope recog-                                                           emergence of antibody escape mutations in virus popu-
The specific parts of an antigen
recognized by the immune
                                     nition and that perhaps the experimental design of the                                                              lations exposed to convalescent plasma, mutations were
system: antibodies, B cells          DMS study is less sensitive to detecting immune evasion                                                             roughly evenly distributed between the RBD and the NTD
or T cells.                          mutations of this type.                                                                                             (Fig. 1b). One study described the emergence of escape

                                                                                                                                                                                                                                         www.nature.com/nrmicro

                                                                                     0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

◀   Fig. 1 | Neutralizing antibody classes defined by structural analyses and properties             change across the NTD and the RBD can drastically evade
    of spike protein residues. a | Amino acid residues of the severe acute respiratory               the polyclonal antibody response. The Δ140+E484K
    syndrome coronavirus 2 (SARS-​CoV-2) spike protein are coloured according to the                 double mutant next acquired an 11-​residue insertion in
    class of the antibody that binds to an epitope. Receptor-​binding domain (RBD) antibody          the NTD N5 loop between Y248 and L249, completely
    classes 1–4 (ref.31) are shown: green for class 1 (ACE2-​blocking antibodies that bind the
                                                                                                     abolishing neutralization. This insertion, which also
    spike protein in the open conformation), yellow for class 2 (ACE2-​blocking antibodies
    that bind the RBD in both the open conformation and the closed conformations),
                                                                                                     introduced a new glycosylation motif in the vicinity of
    blue for class 3 (antibodies that do not block ACE2 and bind the RBD in both the open            RDR4, is predicted to alter the structure of the antigenic
    conformation and the closed conformations) and red for class 4 (neutralizing antibodies          N3 and N5 NTD loops41. This finding further demon-
    that bind outside the ACE2 site and only in the open conformation). When residues                strates the structural plasticity of the NTD and indicates
    belong to epitopes of multiple classes, priority colouring is given to antibodies that block     that insertions and the acquisition of additional glyco-
    ACE2 and bind the closed spike protein. The amino-​terminal domain (NTD) supersite30             sylation motifs in the NTD are further mechanisms in
    is coloured in magenta. b | Aligned heat maps showing properties of amino acid residues          addition to deletion that lead to immune evasion. Other
    where substitutions affect binding by antibodies in polyclonal human blood plasma or             examples of mutations that impact the epitope–paratope
    emerge as antibody escape mutations. The distance in angstroms to the ACE2-​contacting           interface indirectly include mutations in the signal pep-
    residues that form the receptor-​binding site (RBS) is shown in shades of orange; each
                                                                                                     tide region and at cysteine residues 15 and 136, which
    residue is classified as having evidence for mutations affecting neutralization by either
    monoclonal antibodies (mAbs)40,43,47,48 or polyclonal antibodies in plasma from previously       form a disulfide bond that ‘staples’ the NTD amino termi-
    infected individuals (convalescent)39–41,43,48 or vaccinated individuals59 (‘mAb effect’ and     nus against the galectin-​like β-​sandwich30. Mutations at
    ‘plasma effect’, respectively). A subset of these residues has mutations described as            those sites (for example, C136Y and S12P, which alter the
    emerging upon exposure (co-​incubation) to mAbs40,47,48 or plasma40,41 in laboratory             cleavage occurring between residues C15 and V16) have
    experiments (‘mAb emerge’ and ‘plasma emerge’, respectively). When an observation                been shown to affect the neutralizing activity of several
    includes a deletion, this is indicated by a red cross. Shades of green depict the results of     mAbs, likely disrupting the disulfide bond and therefore
    deep mutational scanning (DMS) experiments where yeast cells expressing RBD mutants              dislodging the supersite targeted by several antibodies30.
    are incubated with multiple samples of human convalescent plasma39. The escape                       Across the spike protein, some mutations that con-
    fraction (that is, a quantitative measure of the extent to which a mutation reduced              fer escape to neutralizing mAbs have little impact on
    polyclonal antibody binding) averaged across all amino acid substitutions at a residue
                                                                                                     serum antibody binding39,40,44, possibly because those
    (‘plasma average’) and the maximally resistant substitution (‘plasma max’) are indicated.
    DMS data on ACE2-​binding affinity19 are shown in shades of red or blue representing
                                                                                                     mAbs are rare in polyclonal sera, targeting subdominant
    higher or lower ACE2 affinity, respectively. The mean change in binding affinity averaged        epitopes12,39,44. Escape mutations emerging in viruses
    across all mutations at each site (‘binding average’) and alternatively the maximally            exposed to convalescent plasma have been identified in
    binding mutant (‘binding max’) is shown. Scores represent binding constants (Δlog10 KD)          both the NTD (ΔF140, N148S, K150R, K150E, K150T,
    relative to the wild-​type reference amino acid.                                                 K150Q and S151P) and the RBD (K444R, K444N,
                                                                                                     K444Q, V445E and E484K)40,41 (Fig. 1b). Notably, muta-
                                                                                                     tions emerging under selective pressure from convales-
                                       mutations in viruses exposed to convalescent plasma           cent plasma may be different from those selected by the
                                       from two individuals, one of which selected for NTD           most frequent mAb isolated from the same plasma40.
                                       mutations only (N148S, K150R, K150E, K150T, K150Q             Potentially, observed differences arise because muta-
                                       and S151P)40. This was despite the plasma being a source      tions selected by convalescent plasma facilitate escape
                                       of the highly potent RBD-​targeting mAb C144 (ref.40).        from multiple mAbs. Fewer data on the antigenic
                                       NTD antibody escape mutations were not observed for           effects of S2 mutations exist, though D769H has been
                                       the other samples of plasma investigated, and further-        described as conferring decreased susceptibility to neu-
                                       more, the 148–151 mutants exhibited only marginal             tralizing antibodies24. Residue 769 is positioned in a
                                       reductions in sensitivity to the plasma tested, indicat-      surface-​exposed S2 loop, and D769H was found to arise,
                                       ing individual immune responses may be differentially         in linkage with Δ69–70, in an immunocompromised
                                       affected by mutations of RBD and NTD epitopes40.              individual treated with convalescent plasma24.
                                           Deletions in the NTD have been observed repeatedly
                                       in the evolution of SARS-​CoV-2 and have been described       Conformational epitopes in spike
                                       as changing NTD antigenicity30,41,42. One study identified    To evaluate potential antigenicity across the spike pro-
                                       four recurrently deleted regions (RDRs) within the NTD        tein, we analysed the protein using BEpro, a program
    Epitope binning                    and tested five frequently observed deletions within these:   for the prediction of conformational epitopes based
    An approach that uses a            Δ69–70 (RDR1), Δ141–144 and Δ146 (RDR2), Δ210                 on tertiary structure 49. This approach calculates a
    competitive immunoassay to         (RDR3) and Δ243–244 (RDR4)42. Of the four RDRs,               structure-​based epitope score, which approximates anti-
    sort a library of monoclonal
    antibodies into discrete groups
                                       RDR1, RDR2 and RDR4 correspond to NTD loops N2,               body accessibility for each amino acid position. For each
    of antibodies that compete for     N3 and N5, whereas RDR3 falls between N4 and N5 in            residue, the calculated score accounts for the local pro-
    access to overlapping epitopes.    another accessible loop (Fig. 2a, asterisk). Both RDR2        tein structure: half-​sphere exposure measures and pro-
                                       deletions, Δ141–144 and Δ146, and Δ243–244 (RDR4)             pensity scores each depend on all atoms within 8–16 Å of
    Convalescent plasma
                                       abolished binding of 4A8 (ref.42). Further evidence of        the target residue, with weighting towards closer atoms.
    Blood serum of a previously
    infected individual that usually   the role of RDR2 deletions in immune escape was pro-          Due to this aggregation, calculated scores are relatively
    contains a mixture of different    vided by a study that describes the emergence of Δ140         insensitive to the effects of single amino acid substitu-
    antibodies referred to as          in SARS-​CoV-2 co-​incubated with potently neutralizing       tions. Scores were calculated for the spike protein in both
    polyclonal antibodies.             convalescent plasma, causing a fourfold reduction in neu-     the closed conformation and the open conformation
    Similarly, postvaccination
    serum includes polyclonal
                                       tralization titre41. This Δ140 spike mutant subsequently      (Fig. 2). It has been estimated that ~34% of spike pro-
    antibodies generated by            acquired the E484K mutation, resulting in a further four-     teins are closed and 27% are open (with the remainder
    vaccination.                       fold drop in neutralization titre, and thus a two-​residue    in an intermediate form) following furin cleavage50.

    Nature Reviews | Microbiology

                                                                      0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

                                                                               Scores rescaled between 0 and 1 are plotted for the closed                Fig, 1b). Notably, scores for residues with mutations
                                                                               conformation in Fig. 2a and are represented on the struc-                 described as affecting plasma antibody recognition are
                                                                               ture in Fig. 2b. A limitation of this approach is that it does            also slightly higher on average compared with those with
                                                                               not account for glycan shielding of residues and likely                   mutations described as affecting mAbs only. Epitope
                                                                               overestimates scores at the base of the ectodomain for                    scores are particularly high for residues with mutations
                                                                               residues closest to the carboxy terminus.                                 described as emerging during exposure to convalescent
                                                                                  Comparisons with reporting of antibody footprints                      plasma40,41 (Supplementary Fig. 1b). Experimental data
                                                                               and the impact of mutations on antigenicity indicate                      on the emergence of mutations under selective pressure
                                                                               that residues with mutations described as affecting                       from polyclonal antibodies are relatively rare, although
                                                                               recognition by mAbs or antibodies in convalescent                         these trends for higher scores associated with such muta-
                                                                               plasma (Fig. 1b) tend to occur at residues with higher                    tions indicate that information from structural analysis
                                                                               structure-​based antibody accessibility scores compared                   approaches of this kind can contribute to the ranking of
                                                                               with other residues belonging to epitope footprints and                   residues at which substitutions are likely to impact the
                                                                               residues not implicated in antigenicity (Supplementary                    polyclonal antibody response.

  a                                                        NTD                    RBD                                                                        b
                                          1.0               N3
                                                      N2                                                                                                                                             Closed
                                                 N1          N4                                                                                                                                      spike
                                                                  * N5
                                          0.8                                          477
                                                                                             501                                               Accessibility
Antibody accessibility score

                                                                                                                                               score
                                                       70                        439                                                              1.0
                                          0.6                                                                                                     0.8
                                            69                                                                                                    0.6
                                                                                       453                                                        0.4
                                          0.4                                                                                                     0.2
                                                                 222
                                                                                                                                                  0.0

                                          0.2

                                                                                                     614
                                          0.0
                                                                                                                                                         d       ACE2-binding site
                                                 0    100        200     300     400    500        600     700   800   900   1,000 1,100
                                                                                   Amino acid position                                                       Upright
                                                                                                                                                             RBD
  c
                                                                                                                                           More
                                           0.4                                                                                             accessible                                                Open
                                                                                                                                           when open                                                 spike
                                           0.2
                                           0.0
   Adjustment to antibody accessibility

                                          −0.2
                                                                                                                                           More
                                                 0    100    200         300    400     500        600     700   800   900   1,000 1,100 accessible
                                                                                                                                           when closed
                                           0.4
                                           0.2
                                           0.0
                                          −0.2
                                                 0    100    200         300    400     500        600     700   800   900   1,000 1,100

                                           0.4
                                           0.2
                                           0.0
                                          −0.2
                                                 0    100        200     300    400     500        600     700   800   900   1,000 1,100

   Fig. 2 | Structure-based analysis of conformational epitopes on the spike                                                          view along this axis (right). c | The extent to which each spike residue becomes
   protein. a | Structure-​based antibody accessibility scores for each spike                                                         more or less accessible when the spike protein is in its open form is shown. For
   protein ectodomain residue in the closed form were calculated with BEpro49.                                                        each spike monomer (upright receptor-​binding domain (RBD) (yellow), closed
   Black diamonds at the top and bottom of the plot indicate the positions of                                                         RBD clockwise adjacent (green) and closed RBD anticlockwise adjacent
   ACE2-​contacting residues. Accessible amino-​terminal domain (NTD) loops                                                           (blue)), the difference relative to the score calculated for the closed form
   N1–N5 are labelled, and a loop falling between these is indicated with an                                                          (shown in part a) is shown. d | Two surface colour representations of antibody
   asterisk. b | Two surface colour representations of antibody accessibility scores                                                  accessibility scores for the spike protein in the open conformation with a
   for the spike protein in the closed conformation according to the colour                                                           single monomer with an upright RBD are shown: a trimer axis vertical view
   scheme in part a: a trimer axis vertical view (left) and an orthogonal top-​down                                                   (left) and an orthogonal top-​down view along this axis (right).

                                                                                                                                                                                          www.nature.com/nrmicro

                                                                                                                        0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

                           Within the RBD, the two areas with high structure-​         frequency of mutations in circulating variants (Fig. 3).
                       based antibody accessibility scores for the closed spike        Globally, the highest number of amino acid variants,
                       structure (Fig. 2a, peaks with consecutive residues with        mapped against the Wuhan-​Hu-1 reference sequence
                       scores greater than 0.8) are centred at residues 444–447        (MN908947), are recorded at amino acid positions
                       and residues 498–500. These areas are represented as            614, 222 and 18 (Fig. 4a) (among 426,623 high-​quality
                       yellow patches near the centre of the top-​down view of         sequences retrieved from the GISAID database on
                       the spike structure in Fig. 2b. Figure 2c shows that, in        3 February 2021 and processed using CoV-​GLUE).
                       general, residues become more accessible and are likelier       Residues at positions 614 and 222 have relatively
                       to form epitopes when the spike protein is in the open          low antibody access scores and are positioned ~50 Å
                       conformation, and this is especially true for the RBD,          from the RBS residues when the spike protein is in
                       particularly for the upright RBD (Fig. 2c, yellow). In the      the open conformation (Fig. 3a,b). As mentioned ear-
                       open form, residues close to the ACE2-​binding site (405,       lier, there is evidence indicating that D614G confers
                       415, 416, 417 and 468) become much more exposed on              a moderate advantage for infectivity8,9 and increases
                       both the upright RBD and the clockwise adjacent closed          transmissibility10. The spike amino acid substitution
                       RBD (Fig. 2c, green). The effect of mutations at these          with the second highest frequency is A222V, which is
                       positions is likely to be greater for antibodies belonging      present in the 20A.EU1 SARS-​CoV-2 cluster (also des-
                       to RBD class 1. Residues centred at 444–447 and 498–500         ignated lineage B.1.177). This lineage has spread widely
                       maintain high scores on the upright RBD and are                 in Europe and is reported to have originated in Spain52.
                       joined by residues in areas 413–417 and 458–465. The            There is no evidence for a notable impact of A222V on
                       only RBD residues that become notably less accessible           virus phenotype (that is, infectivity and transmissibility),
                       in the open spike structure are residues 476, 477, 478,         and so its increase in frequency is generally presumed to
                       586 and 487 of the closed RBD clockwise adjacent to             have been fortuitous rather than a selective advantage.
                       the upright RBD, which become blocked by the upright            The substitution L18F has occurred ~21 times in the
                       RBD (Fig. 2c, green). Several RBD-​specific antibodies are      global population53 and is associated with escape from
                       able to bind only the open spike protein (RBD classes 1         multiple NTD-​binding mAbs30.
                       and 4 (ref.31)), and interestingly, it has been observed that       Among the 5,106 independent substitutions observed
                       D614G makes the spike protein more vulnerable to neu-           in the spike protein (Box 1), 161 are described as affecting
                       tralizing antibodies by increasing the tendency for the         recognition by mAbs or polyclonal antibodies in sera, of
                       open conformation to occur51.                                   which 22 are present in more than 100 sequences. On
                           Within the NTD, the highest-​scoring spike residues in      average, variant frequency is higher at amino acid posi-
                       the closed form belong to a loop centred at residues            tions where mutations are described as affecting anti-
                       147–150, which each have scores greater than 0.9 (Fig. 2a,      body recognition than at positions with no described
                       yellow patch to the extreme right of the structure viewed       substitutions of antigenic importance (Supplementary
                       from the side in Fig. 2b). This loop, known as the N3           Fig. 1a), and high levels of amino acid substitutions are
                       loop, is described as forming key interactions with the         observed at some amino acid positions where mutations
                       neutralizing antibody 4A8 (ref.32). One study described         are described as affecting recognition by antibodies in
                       the structure of five previously unmodelled, protruding         convalescent plasma, including positions 439 and 484.
                       NTD loops, denoting them N1–N5. In addition to N3,              This indicates that, generally, the amino acid positions
                       high-​scoring residues (greater than 0.7) are found at          at which antibody escape mutations have been detected
                       positions 22–26 (N1), 70 (N2), 173–187 (N4), 207–213            in vitro tolerate mutations at least to some degree in vivo.
                       (Fig. 2a, asterisk) and 247–253 (N5). Structural analysis           Within the RBD, the positions at which amino acid
                       indicates NTD-​binding antibodies are likely able to bind       substitutions are present at the highest frequency are
                       epitopes when the spike protein is in either the closed         located close to the RBD–ACE2 interface (Fig. 3). Of the
                       conformation or open the conformation (Fig. 2c). Outside        three RBD amino acid substitutions present in several
                       the NTD and the RBD, the highest-​scoring residues are          thousand sequences, N439K and N501Y were described
                       residues 676 and 689 (which lie on either side of the           earlier, and N501Y is discussed in more detail in the next
                       loop containing the S1–S2 furin cleavage site, which            section in the context of variants of concern. The other
                       is disordered in both the open conformation and the             substitution, S477N, is estimated to have emerged at
                       closed conformation50), 793–794, 808–812, 1,099–1,100           least seven times in the global SARS-​CoV-2 population
                       and 1,139–1,146 (Fig. 2a). When the spike protein is in         and has persisted at a frequency of between 4% and 7%
                       the open conformation, increased accessibility results          of sequences globally since mid-​June 2020 (ref.53). One
                       in substantially higher potential epitope scores for S2         study described multiple mAbs that selected for the
                       residues centred at 850–854, which become more                  emergence of S477N and found this mutant to be resist-
                       accessible on all three spike monomers (Fig. 2c), and           ant to neutralization by the entire panel of RBD-​targeting
                       residues 978–984, which become more accessible on               mAbs that were tested. By contrast, when tested with
                       the monomer anticlockwise adjacent to the upright RBD           convalescent serum, neutralization of the S477N mutant
                       monomer (Fig. 2c, blue).                                        was similar to that of the wild type48. In common with
                                                                                       N439K and N501Y, S477N results in increased affinity
                       Structural context of spike mutations                           for the ACE2 receptor, although to a lesser extent19,54. As
                       To assess the impact of spike mutations and their immuno­       described in Box 2, substitutions may facilitate immune
                       logical role in the global SARS-​C oV-2 population,             escape by increasing receptor-​binding affinity inde-
                       we combined structural analyses with the observed               pendently of any effect that they may have on antibody

Nature Reviews | Microbiology

                                                       0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

a                                                                                                                                b
                                                                                                                                              477
Distance from receptor-binding site (Å)

                                                                                                                                      18

                                          100                                                                       Variant
                                                                                                                    frequency                                                 69–70
                                                                                                                         100,000
                                                                                                                                                                        222            144
                                                                                                                                      614
                                                                              222 69         70                          1,000
                                                              614
                                           50
                                                                                                                         10

                                                                                                                                                                                               439
                                                                                                                                                               501

                                                                                               477
                                            0
                                                                                453         417 501 446
                                                     0.2           0.4            0.6     484 0.8       1.0
                                                              Antibody accessibility score
                                                                                                                                            477
                                                                                                                         d
                                           c
                                                    ACE2

                                                  477                                                                                                             501
                                                  417                                             501
                                                  453                                             439                                                                                         439
                                                                                                                                                            453
                                                Upright RBD

                                                                                                                                                                        477

Fig. 3 | Structural context of spike amino acid mutations in the global                                               b | Spike protein in closed form with all residues coloured according to the
virus population. Spike amino acid residues are coloured according to the                                             frequency scale shown; a trimer axis vertical view (left) and an orthogonal
frequency of amino acid substitutions or deletions. Variants (retrieved from                                          top-​down view along this axis (right) are shown. c | A close-​up view of the
CoV-​GLUE) are based on 426,623 high-​quality sequences downloaded                                                    receptor-​binding domain (RBD) bound to ACE2 (RCSB Protein Data Bank ID
from the Global Initiative on Sharing All Influenza Data (GISAID) database                                            6M0J95), with RBD residues shown as spheres coloured by amino acid variant
on 3 February 2021. a | Points representing each spike amino acid residue                                             frequency and ACE2 shown in gold. Amino acid variants are present at
are positioned according to the antibody accessibility score and the                                                  high frequency in positions at the RBD–ACE2 interface. d | Spike protein
distance to the nearest residue in the receptor-​binding site. Residues with                                          in open form with residues where at least 100 sequences possessing a
at least 100 sequences possessing a substitution or deletion are coloured                                             substitution are highlighted; a trimer axis vertical view (left) and an
according to the frequency scale shown, with the remainder shaded grey.                                               orthogonal top-​down view along this axis (right) are shown.

                                                                    recognition of epitopes; therefore, it is possible that such      was detected in Denmark spreading among farmed mink
                                                                    a mechanism contributes to the impact of S477N on                 and a small number of people20. This lineage is character-
                                                                    neutralization. Variant frequency is also moderately high         ized by four amino acid differences, ΔH69–V70, Y453F,
                                                                    at RBD–ACE2 interface amino acid positions 417, 453               I692V and M1229I (Fig. 5). Of these, the Y453F substitu-
                                                                    and 446. Of these positions, 446 occurs in a location in          tion occurs at a residue within the ACE2 footprint and
                                                                    the spike structure that is predicted to be highly anti-          has been shown by DMS to increase ACE2 affinity19. In
                                                                    genic, and substitutions at this site are described as            addition, Y453F has been described as reducing neu-
                                                                    affecting neutralization by both mAbs and antibodies              tralization by mAbs47. In late 2020 and early 2021, the
                                                                    present in polyclonal serum39,43,46,48. Substitutions at          emergence and sustained transmission of lineages with
                                                                    amino acid positions 417 and 453 are described in the             mutations that affect the characteristics of the virus
                                                                    next section in the context of variants of concern.               received much attention, most notably lineages B.1.1.7,
                                                                                                                                      B.1.351 and P.1 (also known as 501Y.V1, 501Y.V2 and
                                                                    Variants of interest or concern                                   501Y.V3, respectively). The locations of the spike muta-
                                                                    In addition to single mutations of note, more heavily             tions in the B.1.1.298, B.1.1.7, B.1.351 and P.1 lineages
                                                                    mutated SARS-​CoV-2 lineages have emerged. Arguably               are annotated in Fig. 5a, and information on the struc-
                                                                    the first variant of interest defined by the presence of sev-     tural context and consequences of mutations for antibody
                                                                    eral spike mutations, and referred to as B.1.1.298 (cluster 5),   recognition and ACE2 binding are shown in Fig. 5b.

                                                                                                                                                                         www.nature.com/nrmicro

                                                                                                        0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

                                                       Of the lineages summarized in Fig. 5, several amino             in lineages B.1.1.298 and B.1.1.7. Additionally, lineages
                                                   acid substitutions are convergent, having arisen inde-              B.1.351 and P.1 possess alternative amino acid substi-
                                                   pendently in different lineages: N501Y, which is present            tutions K417N and K417T, respectively. Further line-
                                                   in lineages B.1.1.7, B.1.351 and P.1; E484K, which is pres-         ages with these mutations have also been identified; for
                                                   ent in lineages B.1.351 and P.1 and has been detected as            example, an independent emergence of N501Y in the
                                                   emerging within the B.1.1.7 lineage55; and ΔH69–V70                 B.1.1.70 lineage, which is largely circulating in Wales.

a                                                                     S1                                                           S2
                                                    NTD                    RBD     RBM                                       FP                         CD         TM CT
                                                                                                   614
            100,000                                            222
                                          69–70
                                        18          144                             501         570      681         716                      982
                                                                                  477                                                                  1,118
                    10,000                                                        439
Variant frequency

                     1,000

                      100

                       10

                        0

                                    0        100         200         300    400         500     600            700         800     900      1,000     1,100     1,200
                                                                                              Amino acid position

                        b                  RBM
                                                                             c

                                                   RBD
                             NTD

                                   S1                  S2

                                                          FP

                                                       CD

Fig. 4 | Spike protein sequence variability and structure. a | The domain                          particularly within the amino-​terminal domain (NTD). The most frequently
organization of the severe acute respiratory syndrome coronavirus 2                                detected NTD deletion is the two-​residue deletion at positions 69 and 70
(SARS-​CoV-2) spike protein showing amino acid sequence variability.                               (Δ69–70), present in 45,898 sequences. The S1–S2 boundary is at amino
The spike protein is synthesized as a 1,273 amino acid polypeptide, and the                        acid position 685. b | Spike protein monomer displaying an upright
frequency of amino acid variants, including both substitutions and                                 receptor-​binding domain (RBD). c | Spike protein structure in the closed
deletions, at each of the positions is shown. These variants, relative to the                      conformation overlaid with surface representations shown with a trimer
Wuhan-​Hu-1 reference sequence, were identified with use of CoV-​GLUE96,                           axis vertical view (left) and an orthogonal top-​down view along this axis
which filters out Global Initiative on Sharing All Influenza Data (GISAID)                         (right). Domains are coloured as in part a. The RCSB Protein Data Bank IDs
sequences97 identified as being of low quality or from non-​human hosts                            for the SARS-​CoV-2 spike protein structures are 6ZGG and 6ZGE50. The
(sequences retrieved from the GISAID database on 3 February 2021).                                 magenta spheres represent glycans, and the magenta triangles represent
Among 426,623 genomes after filtering, 5,106 different amino acid                                  potantial N-​linked glycosylation sites. The scissors represent the S1–S2
replacements or substitutions across 1,267 spike positions were identified,                        boundary at amino acid position 685. CD, connecting domain;
of which 320 at 259 positions were observed in at least 100 sequences.                             CT cytoplasmic tail; FP, fusion peptide; RBM, receptor-​binding motif;
In addition to substitutions, several deletions have been observed,                                TM, transmembrane domain.

Nature Reviews | Microbiology

                                                                                  0123456789();:
SARS- COV-2 VARIANTS, SPIKE MUTATIONS AND IMMUNE ESCAPE
Reviews

 Box 2 | Mechanisms of antigenic change                                                         antigenic effect, both K417N and K417T are expected
                                                                                                to moderately decrease ACE2-​binding affinity19 (Fig. 5b).
 in common with other virus surface glycoproteins responsible for attachment to                 The ΔH69–V70 deletion has been identified in variants
 host cell-​surface receptors, such as influenza virus haemagglutinin and the envelope          associated with immune escape in immunocompro-
 glycoprotein GP120 of Hiv, the severe acute respiratory syndrome coronavirus 2                 mised individuals treated with convalescent plasma24.
 (sars-​Cov-2) spike glycoprotein is an important target for neutralizing antibodies.
                                                                                                Experiments have shown that ΔH69–V70 does not
 there are various distinct mechanisms by which mutations can alter the antigenic
 properties of a glycoprotein.                                                                  reduce neutralization by a panel of convalescent sera;
                                                                                                however, it may compensate for infectivity deficits asso-
 Amino acid substitutions that alter the epitope                                                ciated with affinity-​boosting RBM mutations that may
 a change in the biophysical properties of an epitope residue directly diminishes
                                                                                                emerge due to immune-​mediated selection22.
 antibody binding. For example, the neutralizing antibody 4a8 forms salt bridges with
 spike protein residues K147 and K150, and therefore substitutions at these residues
                                                                                                    The first genomes belonging to the B.1.1.7 lineage
 are likely to inhibit binding. the e484K amino acid substitution has received attention        were sequenced in the south of England in September
 for its effect on monoclonal antibodies and convalescent plasma neutralizing activity.         2020. Lineage B.1.1.7 is defined by the presence of
 its position has been described as belonging to the footprint of several antibodies,           23 nucleotide mutations across the genome that map
 and a change in charge caused by replacement of a glutamate residue with a lysine              to a single branch of the phylogenetic tree3. Of these
 residue has the potential to diminish antibody binding.                                        23 mutations, 14 encode amino acid changes and three
 increasing receptor-​binding avidity                                                           are deletions, including six amino acid substitutions in
 substitutions that individually increase receptor-​binding affinity can shift the binding      the spike protein (N501Y, A570D, P681H, T716I, S982A
 equilibrium between glycoprotein and neutralizing antibodies in favour of a higher-            and D1118H) and two NTD deletions (ΔH69–V70 and
 ​ vidity interaction between glycoprotein and the cellular receptor102. the spike amino
 a                                                                                              ΔY144)3. The lineage has been associated with a rapidly
 acid substitution N501Y, which increases aCe2-​binding affinity19, has been described          increasing proportion of reported SARS-​CoV-2 cases,
 as emerging in individuals treated with convalescent plasma, potentially as a means            and phylogenetic analyses indicate that this lineage was
 of immune escape.                                                                              associated with a growth rate estimated to be 40–70%
 changes in glycosylation                                                                       higher than that of other lineages60,61. There is also evi-
 Glycans are bulky sugar molecules that may shield epitopes from antibody binding.              dence that this lineage may be associated with a higher
 N-​linked glycans are typically prominent in glycan shielding of virus surface glycoprotein    viral load62. In addition to N501Y, for which there is some
 epitopes33, although O-​linked glycans can also contribute103. a substitution can introduce    evidence that it reduces neutralization by a small propor-
 an additional N-​linked glycosylation motif. the acquisition of epitope-​masking glycans
                                                                                                tion of RBD antibodies63, there is evidence for an anti-
 during the evolution of human influenza viruses is well described104.
                                                                                                genic effect of ΔY144 (Fig. 5b). This deletion is expected to
 Deletions and insertions                                                                       alter the conformation of the N3 NTD loop (amino acid
 the deletion or insertion of residues has the potential to alter epitope conformation,         positions 140–156) and has been demonstrated to abol-
 diminishing antibody binding. several deletions in the spike amino-​terminal domain (NtD)      ish neutralization by a range of neutralizing antibodies30.
 that affect recognition by neutralizing antibodies have been described41,42. in laboratory
                                                                                                The B.1.1.7 spike mutations have been shown to dimin-
 experiments, a multiresidue insertion in the spike NtD has been described as emerging
 and contributing to escape from polyclonal antibodies in convalescent plasma41.                ish neutralization of a higher proportion of NTD-​specific
                                                                                                neutralizing antibodies (9 of 10; 90%) than RBD-​
 Allosteric structural effects                                                                  specific neutralizing antibodies (5 of 31; 16%) 63.
 similarly to deletions or insertions, an amino acid substitution outside an epitope
                                                                                                Given the immunodominance of the RBD, this could
 footprint may affect antibody binding by changing the protein conformation in such
 a way that an epitope is altered or differently displayed. in the spike NtD, changes
                                                                                                explain the modest reductions in neutralizing activity
 to disulfide bonds are thought to reduce binding by multiple monoclonal antibodies             of convalescent sera against authentic B.1.1.7 or pseudo­
 through this mechanism30.                                                                      viruses carrying the B.1.1.7 spike mutations64,65. The
                                                                                                co-​occurrence of ΔY144 and E484K is concerning with
                                                                                                respect to the polyclonal antibody response as the N3
                                 Residue 501 is at the RBD–ACE2 interface (Fig. 2c), and        loop, which ΔY144 changes, is predicted to be among the
                                 N501Y has been shown experimentally to result in one           most immunogenic regions of the spike protein (Fig. 2a),
                                 of the highest increases in ACE2 affinity conferred by a       and amino acid substitutions at position 484 diminish
                                 single RBD mutation19. E484 has been identified as an          neutralization by a range of RBD-​targeting antibodies.
                                 immunodominant spike protein residue, with various                 Reports of lineages with N501Y circulating in the UK
                                 substitutions, including E484K, facilitating escape from       were followed by reports of another lineage possessing
                                 several mAbs40,47,48 as well as antibodies in convales-        N501Y circulating in South Africa (lineage B.1.351),
                                 cent plasma39–41,48. E484K is estimated to have emerged        which has been rapidly expanding in frequency since
                                 repeatedly in the global SARS-​C oV-2 population53             December 2020 (ref.66). In addition to N501Y, lineage
                                 and has been described in two other lineages origi-            B.1.351 is defined by the presence of five further spike
                                 nating from lineage B.1.1.28 in addition to lineage P.1        amino acid substitutions (D80A, D215G, K417N, E484K
                                 reported to be spreading in the state of Rio de Janeiro        and A701V) and a deletion in the NTD, Δ242–244. High
                                 in Brazil (lineage P.2)56 and in the Philippines (line-        numbers of B.1.351 viruses also have the spike amino
Avidity                          age P.3)57. Whereas K417 is described in the epitopes          acid substitutions L18F, R246I and D614G. A similar
Also referred to as functional   of RBD class 1 and class 2 antibodies31, alterations to        NTD deletion, Δ243–244, abolishes binding by the anti-
affinity, the accumulated        K417 tend to affect class 1 antibody binding and are           body 4A8 (ref.42), and L18F and R246I also occur within
binding strength of multiple     therefore somewhat less important for the polyclonal           the NTD supersite and likely affect antibody binding as
affinities of individual
interactions, such as between
                                 antibody response to the RBD, which is dominated by            well30. The co-​occurrence of K417N, E484K and these
a virus receptor-​binding site   class 2 antibody responses, which are more susceptible         NTD substitutions suggests that lineage B.1.351 may
and its cellular receptor.       to substitutions such as E484K44,58,59. In addition to their   overcome the polyclonal antibody response by reducing

                                                                                                                                  www.nature.com/nrmicro

                                                                   0123456789();:
Reviews

IC50
                                   neutralization by class 1 and class 2 RBD-​specific anti-       between December 2020 and February 2021, indicating
The half-​maximal inhibitory       bodies and NTD-​specific antibodies (Fig. 5b) . Data            that this amino acid substitution is probably the result
concentration, a quantitative      reported in one study showed that nearly half of exam-          of viral adaptation due to increasing immunity in the
measure that indicates             ined convalescent plasma samples (21 of 44; 48%) had            population75. L452R is also present in the A.27 lineage
how much of an inhibitory
                                   no detectable neutralization activity against the B.1.351       associated with a cluster of cases identified on the island
substance (for example,
postvaccination serum) is          variant58. Other experiments with pseudotyped viruses           of Mayotte76. The lineage B.1.526 has been found to carry
required to inhibit a biological   showed that the B.1.351 variant was also resistant to the       either S477N or E484K, among other lineage-​defining
process (for example, virus        neutralizing activity of some mAbs (12 of 17; 70%)67.           mutations77,78, both of which were described as antigen-
forming plaques or regions
                                   Similarly, a study showed that the neutralizing effect of       ically important above. A new variant carrying E484K
of infected cells in culture)
by 50%.
                                   convalescent plasma collected from 14 individuals was           currently designated A.VOI.V2 was recently identified
                                   strongly reduced against a live (authentic) B.1.351 virus       in Angola from cases involving travel from Tanzania79.
                                   (with IC50 reduced by 3.2-​fold to 41.9-​fold relative to the   This variant carries several amino acid substitutions in
                                   first-​wave virus)68.                                           the spike protein and three deletions in the NTD, some
                                       The P.1 lineage, a sublineage of B.1.1.28, was first        of which are within the antigenic supersite79. Another
                                   detected in Brazil69 and in travellers from Brazil to           variant within the A lineage, the prevalence of which is
                                   Japan70. Lineage P.1 is characterized by the presence           rising in Uganda (A.23.1), shares with the B.1.1.7 lineage
                                   of several amino acid substitutions in the spike pro-           a substitution at position 681 within the furin cleavage
                                   tein: L18F, T20N, P26S, D138Y, R190S, K417T, E484K,             site (P681R has been found in the A lineage, whereas
                                   N501Y, H655Y and T1027I69. In addition to substitutions         P681H has been found in the B.1.1.7 lineage), and addi-
                                   at positions 417, 484 and 501 discussed above, the P.1 lin-     tionally has the amino acid substitutions R102I, F157L,
                                   eage has a cluster of substitutions close to the described      V367F and Q613H. Q613H is speculated to be impor-
                                   antigenic regions of the NTD, including L18F, which is          tant as it occurs at a position neighbouring D614G80.
                                   known to reduce neutralization by some antibodies30.            Amino acid position 157 has been identified as an
                                   The substitutions T20N and P26S also occur in or                epitope residue, with F157A reducing neutralization by
                                   near the NTD supersite30 at residues with high anti-            the mAb 2489 (ref.34).
                                   body accessibility scores (Fig. 5b), and T20N introduces             New variants will continue to emerge, and although
                                   a potential glycosylation site that could result in glycan      it is important to understand the phenotypes of emerg-
                                   shielding (Box 2) of part of the supersite. The P.1 line-       ing variants in terms of infectivity, transmissibility, vir-
                                   age has also been associated with a reinfection case in         ulence and antigenicity, it is also important to quantify
                                   Manaus, Brazil27, indicating it is contributing to anti-        the phenotypic impacts of specific mutations present
                                   genic circumvention of what might have been an oth-             in variants, both individually and in combination with
                                   erwise effective immune response. Analyses integrating          other mutations. As new variants with unforeseen com-
                                   genomic and mortality data estimate that P.1 may be             binations of mutations continue to emerge, such insights
                                   1.7 to 2.4-​fold more transmissible and that previous           will allow predictions of virus phenotype. For example,
                                   infection by non-​P.1 SARS-​C oV-2 provides 54–79%              recently detected viruses of lineage B.1.617.1 were antici­
                                   of the protection against P.1 infection compared with           pated to show altered antigenicity due to the presence of
                                   non-​P.1 lineages71. More details of the frequency and          the substitutions L452R and E484Q, which have been
                                   geographic distribution of the P1 lineage can be found          described as affecting antibody recognition39,43,45,48,81.
                                   at the Pango lineages website72.                                Moving forwards, the experimental characterization
                                       Increasingly, lineages possessing independent               of SARS-​CoV-2 spike mutations to date will continue
                                   occurrences of mutations in common with the variants            to provide extremely useful information on individual
                                   of concern B.1.1.7, B.1.351 and P.1 are being detected,         mutations or combinations of mutations that may not
                                   demonstrating convergent evolution. For example,                yet have been seen in circulating viruses.
                                   viruses of lineage B.1.525, which has been observed in
                                   several countries, albeit at low frequency to date, have        Vaccine efficacy against new variants
                                   NTD deletions ΔH69–V70 and ΔY144 in common with                 To date, vaccines have been licensed and rolled out very
                                   viruses of the B.1.1.7 lineage; E484K in common with the        successfully in several countries, but the number of indi-
                                   B.1.351 and P.1 lineages; and spike amino acid substitu-        viduals vaccinated still represents a small fraction of the
                                   tions Q52R, Q677H and F888L73. Repeated amino acid              global population (Supplementary Table 1). To assess
                                   substitutions at position 677 and the independent emer-         the impacts of mutations on vaccine efficacy, authentic
                                   gence of Q677H in several lineages in the USA provides          viruses and pseudoviruses possessing particular spike
                                   strong evidence of adaptation, potentially through an           mutations (either individually or in combination) and
                                   effect of this mutation on the proximal polybasic furin         larger sets of mutations representing variants of concern
                                   cleavage site, although further experiments are required        and other circulating spike mutations have been assessed
                                   to determine its impact74. Other novel variants have            by neutralization assays with postvaccination sera
                                   been identified spreading in California and New York,           (Supplementary Table 1). Typically, studies report a fold
                                   USA (B.1.427 and B.1.429, and B.1.526, respectively).           change in variant virus, or pseudovirus, neutralization rel-
                                   The B.1.427 and B.1.429 variants carry an antigeni-             ative to wild-​type virus (the serum concentration at which
                                   cally noteworthy substitution, L452R75, which has been          50% neutralization (IC50) is achieved with the variant
                                   shown to reduce neutralization by several mAbs43,45,48,59       divided by the average IC50 for the wild-​type virus).
                                   and convalescent plasma 43. L452R independently                     Postvaccination sera from a cohort of 20 volun-
                                   appeared in several other lineages around the globe             teers immunized with the mRNA vaccine mRNA-1273

Nature Reviews | Microbiology

                                                                   0123456789();:
You can also read