Dear Editor,

By the end of June 2020, the pandemic of coronavirus diseases 2019 (COVID-19) had resulted in more than 10 million individuals, all over the world, being infected with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).1 The high contagiousness of SARS-CoV-2 virus was largely attributed to the relatively unique sequence composition in its spike (S) glycoprotein, which is in charge of the host entry by interacting with its cellular receptor angiotensin-converting enzyme 2 (ACE2).2,3 The S glycoprotein can be processed into an N terminal S1 fragment that is responsible for receptor binding, and a C terminal S2 fragment that functions to promote membrane fusion.4 SARS-CoV-2 is a type of single-stranded positive-sense ribonucleic acid (RNA) virus, and an increasing number of mutations were identified across the SARS-CoV-2 genome, including the region encoding S glycoprotein. Nevertheless, the functional implications of these mutations remain largely unknown, we here reported the mutation analysis of S gene and a functional exploration of the dominant D614G mutation.

In total, 9002 S gene sequences were extracted from the high-quality complete genomes deposited in the GISAID EpiCoV database as of April 28, 2020. Sequence alignment identified 6253 non-synonymous mutations at 82 different sites of the S gene (Supplementary Table S1 and Fig. 1a). While the majority of the mutations (for 81 sites) were in low frequency (<0.6%), the D614G stood out as a prominent mutation, accounting for about 90% (5583/6253) mutations identified (Supplementary Table S1), with a site-specific mutation frequency of >62% (5583/9002) (Fig. 1a), suggesting potential impacts on the function of S protein.

Fig. 1
figure 1

Enhanced infectivity of D614G mutant of SARS-CoV-2 virus. a The mutation profile of SARS-CoV-2 spike glycoprotein. From a set of 9002 SARS-CoV-2 genome sequences, mutations were identified at 82 positions across the gene encoding S glycoprotein with the D614G being the dominant mutation (>62%); for the plot of mutation counts (the left Y axis), only those showing more than 5 counts were displayed. b Three-dimensional structure modeling of the D614 (upper) and G614 (lower) S glycoprotein colored by chain. The α subunit in red is on the left, the neighboring β subunit in green is on the right. The D614, G614, and T859 residues are displayed in the style of scaled ball and stick with simulated surfaces in yellow. c Detection of the expression of the indicated proteins by Western blot. The spike protein was blotted by an antibody against the S2 region. d, e Altered expression of spike trimer in G614 mutant. d Expression of two spike proteins in the absence and presence of ACE2 in 293T cells at the time points of 12 and 48-h post transfection by Western blot. e Quantification of the relative expression of spike trimer and monomer with the respective D614 expression in the absence of ACE2 as the reference. All results were normalized by the expression of β-actin. Data are mean ± SD of triple quantification. f, g Quantification of syncytia formation upon expression of the indicated S glycoprotein in 293T-ACE2 cells (f) and Hela-ACE2 cells (g). Data are the mean ± SD of results from 4–5 fields (20x objective lens). ***p < 0.001; ****p < 0.0001. More than three replicates were done for the experiment. h, i The expression of the luciferase reporter in 293T-ACE2 cells (h) and Hela-ACE2 cells (i) upon infection of viruses pseudotyped with D614 or G614 S glycoproteins as indicated. Data are the mean ± SD of the results of quadruplicate. Fold changes between D614 and G614 genotypes are displayed. More than three replicates were done for the experiment

To explore the potential effects of the D614G mutation on the spatial structure of S glycoprotein, we performed three-dimensional (3D) modeling with the template of published SARS-CoV-2 S structure (6vxx.pdb) by the Modeling algorithm at SWISS-MODEL. The native S protein trimer was comprised of 3 identical subunits, the Aspartic acid residue at position 614 (D614) of wild type S protein located at a region interfacing with the neighboring subunit (Fig. 1b, left α subunit in red), and was spatially close to and potentially interacted with the T859 residue of neighboring subunit by an inter-subunit hydrogen bond (Fig. 1b, left β subunit in green). This potential inter-subunit interaction mediated by D614-T859 was disrupted by the D to G mutation due to the loss of the side chain, which eliminated the inter-subunit hydrogen bond and may increase the flexibility of the main chain, as indicated by the surface simulation of T859, V860, L861 residues of the neighboring subunit (Fig. 1b, left β subunit in yellow). Meanwhile, the backbone amine of the glycine (G) residue shortened the hydrogen bond with the carboxyl group of neighboring amino acids (A647) within the same subunit and strengthened it, which might locally stabilize the S protein. Thus, the D614G mutation may conceivably affect the stability of the S protein trimer and its related functions, such as host entry and viral infectivity.

In order to examine the potential functional outcomes of the D614G mutation, we made two constructs expressing the wild type (D614) and mutant (G614) SARS-CoV-2 S glycoprotein, respectively. Transfection of the same amount of these two constructs into 293T cells expressing ACE2 (293T-ACE2) effectively expressed full length (FL) S protein (Fig. 1c) at both 24 and 48-h post-transfection as detected by an antibody against the S2 region. Whereas, increased S2 production, which is resulted from protease-mediated cleavage of S protein, was observed in the G614 mutant as compared with that in the wild type D614 transfectant, the difference was even more obvious for transfectants of 48-h post-transfection (Fig. 1c), suggesting that the D614G mutation enhanced S protein processing, which is consistent with the idea that the D614G mutation may affect the stability of the S protein trimer as suggested above based on structural analysis (Fig. 1b). Interestingly, the expression of the super-shifted band above the full length S protein, representing for the trimer, was higher in G614 than D614 in the absence of ACE2 at both 12 and 48-h post-transfection, but was lower in G614 than D614 in the presence of ACE2 at 12-h post-transfection, and the difference disappeared at 48-h post-transfection, probably due to the saturated effects from ACE2 which could clearly promote the accumulation of overexpressed S protein (Fig. 1d, e). The results fit well with a bimodular effect of D614G mutation on S-trimer stability depending on ACE2 engagement, i.e., G614 mutation increased S-trimer stability on virion in the absence of ACE2, but promoted S-trimer dissociation, upon engaged with ACE2, to expose the S2 fragment (Fig. 1c), which conceivably contributed to high contagiousness.

Since SARS-CoV-2 infection could induce membrane fusion (Supplementary Fig. S1) via S expression, the increased S2 production may predict the enhanced ability of the D614G mutant in inducing membrane fusion. To test this hypothesis, we took the advantages of syncytium formation assay as a surrogate of S protein-induced membrane fusion. As shown in Supplementary Fig. S2, a number of syncytia were formed in 293T-ACE2 cells 24-h post-transfection of wild type D614 S construct, while the G614 mutant exhibited significantly stronger induction of syncytium formation as indicated by higher fusion index calculated as the percentage of nuclei in syncytia (Fig. 1f), the difference maintained at 48-h post-transfection, when extensive syncytium formation had occurred (Supplementary Fig. S2 and 1f). the effect of D614G on syncytium formation was also confirmed in Hela cells engineered to stably express ACE2 cells (Fig. 1g). These results indicate that the D614G mutation may endow enhanced fusion ability to the SARS-CoV-2 S glycoprotein.

To examine whether the enhanced fusion ability could be translated into enhanced virus infection, we made viruses pseudotyped with wild type and G614 S proteins, and infected 293T-ACE2 cells and Hela-ACE2 cells with the same amount of virus particles. As shown in Fig. 1h, i, the viruses pseudotyped with G614 mutant displayed luciferase activities about 3.1 ~ 4.2 times higher than those pseudotyped with wild type (D614) S protein by the time of 24-h post infection; and the fold changes increased to 5.2 ~ 5.6 by the time of 48-h post infection. Moreover, the luciferase activities positively correlated with membrane fusion indexes (Fig. 1f–i). Thus, these data are consistent with the notion that the D614G mutation enhanced SARS-CoV-2 infectivity via membrane fusion-mediated host entry.

Interestingly, the Hela-ACE2 cells, but not 293T-ACE2 cells, expressed considerable amounts of TMPRSS2, which was also involved in the phenotypes mediated by S-G614 as the treatment of Camostat, a TMPRSS2 inhibitor, compromised S-G614-induced phenotypes (Supplementary Fig. S3).

Consistent with our finding, a recent study by Kober et al.5 reported that the SARS-CoV-2 variant containing the D614G mutation, initially beginning spreading in Europe, rapidly became a dominant form in regions where it was introduced. And the recurrent pattern of the increase occurred at multiple geographic levels, indicating high efficacy of transmission. Together with our findings, it is proposed that the SARS-CoV-2 variant of G614 genotype is highly transmissible, which is probably attributed to an increased S2 production associated with bi-modularly altered S-trimer stability, leading to enhanced membrane fusion and host entry. Therefore, particular attention should be paid to the emergence of this variant in a region to help prevent recurrent COVID-19 outbreak.