Genomic reconstruction of the SARS-CoV-2 epidemic in England

Vöhringer, Harald S.; Sanderson, Theo; Sinnott, Matthew; De Maio, Nicola; Nguyen, Thuy; Goater, Richard; Schwach, Frank; Harrison, Ian; Hellewell, Joel; Ariani, Cristina V.; Gonçalves, Sonia; Jackson, David K.; Johnston, Ian; Jung, Alexander W.; Saint, Callum; Sillitoe, John; Suciu, Maria; Goldman, Nick; Panovska-Griffiths, Jasmina; Birney, Ewan; Volz, Erik; Funk, Sebastian; Kwiatkowski, Dominic; Chand, Meera; Martincorena, Inigo; Barrett, Jeffrey C.; Gerstung, Moritz

doi:10.1038/s41586-021-04069-y

Download PDF

Article
Open access
Published: 14 October 2021

Genomic reconstruction of the SARS-CoV-2 epidemic in England

Nature volume 600, pages 506–511 (2021)Cite this article

31k Accesses
50 Citations
949 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 07 June 2022

This article has been updated

Abstract

The evolution of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus leads to new variants that warrant timely epidemiological characterization. Here we use the dense genomic surveillance data generated by the COVID-19 Genomics UK Consortium to reconstruct the dynamics of 71 different lineages in each of 315 English local authorities between September 2020 and June 2021. This analysis reveals a series of subepidemics that peaked in early autumn 2020, followed by a jump in transmissibility of the B.1.1.7/Alpha lineage. The Alpha variant grew when other lineages declined during the second national lockdown and regionally tiered restrictions between November and December 2020. A third more stringent national lockdown suppressed the Alpha variant and eliminated nearly all other lineages in early 2021. Yet a series of variants (most of which contained the spike E484K mutation) defied these trends and persisted at moderately increasing proportions. However, by accounting for sustained introductions, we found that the transmissibility of these variants is unlikely to have exceeded the transmissibility of the Alpha variant. Finally, B.1.617.2/Delta was repeatedly introduced in England and grew rapidly in early summer 2021, constituting approximately 98% of sampled SARS-CoV-2 genomes on 26 June 2021.

COVID-19 in Amazonas, Brazil, was driven by the persistence of endemic lineages and P.1 emergence

Article 25 May 2021

The phylodynamics of SARS-CoV-2 during 2020 in Finland

Article Open access 10 June 2022

Genomic epidemiology reveals multiple introductions of SARS-CoV-2 from mainland Europe into Scotland

Article 21 December 2020

Main

The SARS-CoV-2 virus accumulates approximately 24 point mutations per year, or 0.3 mutations per viral generation^1,2,3. Most of these mutations appear to be evolutionarily neutral but, as the SARS-CoV-2 epidemic spread around the world during spring 2020, it became apparent that the virus is continuing to adapt to its human host. An initial sign was the emergence and global spread of the spike protein variant D614G in the second quarter of 2020. Epidemiological analyses estimated that this mutation, which defines the B.1 lineage, confers a 20% transmissibility advantage over the original A lineage that was isolated in Wuhan, China⁴.

A broad range of lineages have been defined since that can be used to track SARS-CoV-2 transmission across the globe^5,6. For example, B.1.177/EU-1 emerged in Spain in early summer 2020 and spread across Europe through travel⁷. Subsequently, four variants of concern (VOCs) have been identified by the WHO and other public health authorities: the B.1.351/Beta lineage was discovered in South Africa⁸, where it spread rapidly in late 2020. The B.1.1.7/Alpha lineage was first observed in Kent in September 2020 (ref. ⁹) from where it swept through the United Kingdom and large parts of the world due to a 50–60% increase^10,11,12,13 in transmissibility. P.1/Gamma originated in Brazil^14,15 and has spread throughout South America. Most recently, B.1.617.2/Delta was associated with a large surge of coronavirus disease 2019 (COVID-19) in India in April 2021 and subsequently around the world.

Epidemiology of SARS-CoV-2 in England

In the United Kingdom, by late June 2021 the COVID-19 Genomics UK Consortium (COG-UK) had sequenced close to 600,000 viral samples. These data have enabled a detailed reconstruction of the dynamics of the first wave of the epidemic in the United Kingdom between February and August 2020 (ref. ¹⁶). Here we leverage a subset of those data—genomic surveillance data generated at the Wellcome Sanger Institute—to characterize the growth rates and geographical spread of different SARS-CoV-2 lineages and reconstruct how newly emerging variants changed the course of the epidemic.

Our data cover England between 1 September 2020 and 26 June 2021, encompassing three epidemic waves and two national lockdowns (Fig. 1a). In this time period, we sequenced 281,178 viral genomes, corresponding to an average of 7.2% (281,178/3,894,234) of all of the positive tests from PCR testing for the wider population, ranging from 5% in winter 2020 to 38% in early summer 2021, and filtered to remove cases that were associated with international travel (Methods and Extended Data Fig. 1a, b). Overall, a total of 328 SARS-CoV-2 lineages were identified using the PANGO lineage definition⁵. As some of these lineages were only rarely and intermittently detected, we collapsed these on the basis of the underlying phylogenetic tree into a set of 71 lineages for modelling (Fig. 1b–d and Supplementary Tables 1 and 2).

**Fig. 1: SARS-CoV-2 surveillance sequencing in England between September 2020 and June 2021.**

These data reveal a diversity of lineages in the fall of 2020 followed by sweeps of the Alpha and Delta variants (Fig. 1b and Supplementary Tables 2 and 3). Figure 1c shows the geographical distribution of cases and of different lineages, studied at the level of 315 English lower tier local authorities (LTLAs), administrative regions with approximately 100,000–200,000 inhabitants.

Modelling the dynamics of SARS-CoV-2

We developed a Bayesian statistical model that tracks the fraction of genomes from different lineages in each LTLA in each week and fits the daily total number of positive Pillar 2 tests (Methods and Extended Data Fig. 2). The multivariate logistic regression model is conceptually similar to previous approaches in its estimation of relative growth rates^10,11. It accounts for differences in the epidemiological dynamics between LTLAs, and enables the introduction of new lineages (Fig. 2a–c). Despite the sampling noise in a given week, the fitted proportions recapitulate the observed proportions of genomes as revealed by 35 example LTLAs covering the geography of England (Fig. 2b, c and Supplementary Notes 1 and 2). The quality of fit is confirmed by different probabilistic model selection criteria (Extended Data Fig. 3) and also evident at the aggregated regional level (Extended Data Fig. 4).

**Fig. 2: Spatiotemporal model of 71 SARS-CoV-2 lineages in 315 English LTLAs between September 2020 and June 2021.**

Although the relative growth rate of each lineage is modelled as identical across LTLAs, the local viral proportions change dynamically due to the timing and rate of introduction of different lineages. The model also calculates total and lineage-specific local incidences and time-dependent growth rates and approximate reproduction numbers R_t by negative binomial spline fitting of the number of daily positive PCR tests (Methods, Fig. 2d and Extended Data Fig. 2c). Together, this enables a quantitative reconstruction of different periods of the epidemic, which we will discuss in chronological order.

Multiple subepidemics in autumn 2020

Autumn 2020 was characterized by a surge of cases—concentrated in the north of England—that peaked in November, triggering a second national lockdown (Fig. 1a, c). This second wave initially featured B.1 and B.1.1 sublineages, which were slightly more prevalent in the south and north of England, respectively (Fig. 2b, c). Yet, the proportion of B.1.177 and its geographically diverse sublineages steadily increased across LTLAs from around 25% at the beginning of September to 65% at the end of October. This corresponds to a growth rate of between 8% (growth per 5.1 d; 95% confidence interval (CI) = 7–9%) and 12% (95% CI = 11–13%) greater than that of B.1 or B.1.1. The trend of B.1.177 expansion relative to B.1 persisted throughout January (Extended Data Fig. 5a) and involved a number of monophyletic sublineages that arose in the UK, and similar patterns were observed in Denmark¹⁷ (Extended Data Fig. 5b). Such behaviour cannot easily be explained by international travel, which was the major factor in the initial spread of B.1. throughout Europe in summer 2020 (ref. ⁷). However, the underlying biological mechanism is unclear as the characteristic A222V spike variant is not believed to confer a growth advantage⁷.

The spread of Alpha during restrictions

The subsequent third wave from December 2020 to February 2021 was almost exclusively driven by Alpha/B.1.1.7, as described previously^10,11,18. The rapid sweep of Alpha was due to an estimated transmissibility advantage of 1.52 compared with B.1.1 (growth per 5.1 d; 95% CI = 1.50–1.55; Fig. 2a), assuming an unchanged generation interval distribution¹⁹. The growth advantage is thought to stem, at least in part, from spike mutations that facilitate ACE2 receptor binding (N501Y)^20,21 and furin cleavage (P681H)²². Alpha grew during a period of restrictions, which proved to be insufficient to contain its spread (Fig. 3a).

**Fig. 3: Growth of B.1.1.7/Alpha and other lineages in relation to lockdown restrictions between November 2020 and March 2021.**

The second national lockdown from 5 November to 1 December 2020 successfully reduced the total number of cases, but this masked a lineage-specific increase (R_t > 1; defined as growth per 5.1 d) in Alpha and a simultaneous decrease in other hitherto dominant lineages (R_t < 1) in 78% (246/315) of LTLAs²³ (Fig. 3b, c). This pattern of Alpha-specific growth during lockdown is supported by a model-agnostic analysis of raw case numbers and proportions of Alpha genomes (Fig. 3e).

Three levels of regionally tiered restrictions were introduced in December 2020 (ref. ²⁴) (Fig. 3a). The areas under different tiers of restrictions visibly and quantitatively coincide with the resulting local R_t values, with greater R_t values in areas with lower restrictions (Fig. 3a–c).The reopening caused a surge of cases across all tiers with R_t > 1, which is also evident in selected time series (Fig. 3d). As Alpha cases surged, more areas were placed under tier 3 restrictions, and stricter tier 4 restrictions were introduced. Nevertheless, Alpha continued to grow (R_t > 1) in most areas, presumably driven by increased social interaction over Christmas (Fig. 3c).

After the peak of 72,088 daily cases on 29 December 2020 (Fig. 1a), a third national lockdown was announced on 4 January 2021 (Fig. 3a). The lockdown and increasing immunity derived from infection and increasing vaccination²⁵ led to a sustained contraction of the epidemic to approximately 5,500 daily cases by 8 March, when restrictions began to be lifted by reopening schools (further steps of easing occurred on 12 April and 17 May). In contrast to the second national lockdown 93% (296/315) of LTLAs exhibited a contraction in both Alpha and other lineages (Fig. 3e).

Elimination of lineages in early 2021

The lineage-specific rates of decline during the third national lockdown and throughout March 2021 resulted in large differences in lineage-specific incidence. Cases of Alpha contracted nationally from a peak of around 50,000 daily new cases to approximately 2,750 on 1 April 2021 (Fig. 4a). At the same time, B.1.177—the most prevalent lineage in November 2020—fell to less than an estimated 10 cases per day. Moreover, the incidence of most other lineages present in autumn 2020 was well below 1 after April 2021, implying that the majority of them have been eliminated. The number of observed distinct PANGO lineages declined from a peak of 137 to only 22 in the first week of April 2021 (Fig. 4b). Although this may be attributed in part to how PANGO lineages were defined, we note that the period of contraction did not replenish the genetic diversity lost due to the selective sweep by Alpha (Extended Data Fig. 6).

**Fig. 4: Elimination of SARS-CoV-2 lineages during spring 2021.**

Refractory variants with E484K mutations

Parallel to the elimination of many formerly dominant SARS-CoV-2 lineages, a number of new variants were imported or emerged (Fig. 4a). These include the VOCs B.1.351/Beta and P.1/Gamma, which carry the spike variant N501Y that is also found in B.1.1.7/Alpha and a similar pair of mutations (K417N/T and E484K) that were each shown to reduce the binding affinity of antibodies from vaccine-derived or convalescent sera^{20,26,27,28,29} . The ability to escape from previous immunity is consistent with the epidemiology of Beta in South Africa⁸ and especially the surge of Gamma in Manaus¹⁵. The variants B.1.525/Eta, B.1.526/Iota, B.1.1.318 and P.2/Zeta also harbour E484K spike mutations as per their lineage definition, and sublineages of Alpha and A.23.1 that acquired E484K were found in England (Fig. 5a, b).

**Fig. 5: Dynamics of E484K variants and Delta between January and June 2021.**

The proportion of these E484K-containing variants was consistently 0.3–0.4% from January to early April 2021. A transient rise, especially of the Beta and Gamma variants, was observed in May 2021 (Fig. 5a, b). Yet, the dynamics were largely stochastic and characterized by a series of individual and localized outbreaks, possibly curtailed by local surge testing efforts against Beta and Gamma variants (Fig. 5c). Consistent with the transient nature of these outbreaks, the estimated growth rates of these variants were typically lower than Alpha (Fig. 2a).

Sustained imports from international travel were a critical driving mechanism behind the observed number of non-Alpha cases. A phylogeographical analysis establishing the most parsimonious sets of monophyletic and exclusively domestic clades, which can be interpreted as individual introductions, confirmed that A.23.1 with E484K (1 clade) probably has a domestic origin as no genomes of the same clade were observed internationally (Methods, Fig. 5d and Extended Data Fig. 7). The estimated number of introductions was lowest for B.1.1.318 (3 introductions, range = 1–6), and highest for Beta (49 introductions, range = 45–58) and Eta (30 introductions, range = 18–34). Although our data exclude genomes sampled directly from travellers, these repeated introductions show that the true rate of transmission is lower than the observed increase in the number of surveillance genomes.

The rise of Delta from April to June 2021

The B.1.617.1/Kappa and B.1.617.2/Delta lineages, which were first detected in India in 2020, first appeared in English surveillance samples in March 2021. In contrast to other VOCs, Delta/Kappa do not contain N501Y or E484K mutations, but their L452R mutation may reduce antibody recognition²⁷ and P681R enhances furin cleavage³⁰, similar to the P681H mutation of Alpha. The frequency of Delta, which harbours further spike mutations of unknown function, increased rapidly and reached levels of 98% (12,474/12,689) on 26 June 2021 (Fig. 5a, b). Although initially constrained to a small number of large local clusters, such as in Bolton, in May 2021 (Fig. 5c), Delta was detected in all LTLAs by 26 June 2021 (Fig. 1c). The sweep of Delta occurred at a rate of around 59% (growth per 5.1 d, CI = 53–66) higher than Alpha with minor regional variation (Fig. 2a, Extended Data Fig. 4e and Supplementary Table 4).

The rapid rise of Delta contrasts with Kappa, which grew more slowly despite being introduced at a similar time and into a similar demographic background (Figs. 2a and 5b). This is also evident in the phylogeographical analysis (based on data as of 1 May 2021). The 224 genomes of Delta derive from larger clades (23 introductions, range = 6–40; around 10 genomes for every introduction) compared with the 80 genomes of Kappa (17 introductions, range = 15–31; around 3–4 genomes per introduction) and also other variants (Fig. 5d and Extended Data Fig. 8). The AY.1 lineage, derived from Delta and containing an additional K417N mutation, appeared only transiently (Fig. 5b).

The sustained domestic growth of Delta and its international spread³¹ relative to the Alpha lineage are the first evidence of a biological growth advantage. The causes appear to be a combination of increased transmissibility and immune evasion. Evidence for higher transmissibility includes the fast growth in younger unvaccinated age groups, reports of elevated secondary attack rates³² and a higher viral load³³. Furthermore, vaccine efficacy against infection by Delta is diminished, depending on the type of vaccine^34,35, and reinfection is more frequent³⁶, both supported by experimental research demonstrating the reduced antibody neutralization of Delta by vaccine-derived and convalescent sera^37,38.

The higher growth rate of Delta—combined with gradual reopening and proceeding vaccination—repeated the dichotomous pattern of lineage-specific decline and growth, although now with declining Alpha (R_t < 1) and growing Delta (R_t > 1; Fig. 5e, f). Overall, we estimate that the spread of more transmissible variants between August 2020 and early summer 2021 increased the average growth rate of circulating SARS-CoV-2 in England by a factor of 2.39 (95% CI = 2.25–2.42; Fig. 5g). Thus, previously effective interventions may prove to be insufficient to contain newly emerging and more transmissible variants.

Discussion

Our dense genomic surveillance analysis identified lineages that consistently grew faster than others in each local authority and, therefore, at the same time, under the same restrictions and in a comparable population. This pinpointed a series of variants with elevated transmissibility, in broad agreement with other reports^{10,11,13,15,31}. However, a number of limitations exist. The growth rates of rare new variants are stochastic due to introductions and superspreading. Local outbreaks of the Beta and Gamma variants triggered asymptomatic surge testing, which may have reduced their spread. Furthermore, transmission depends both on the viral variant and the immunity of the host population, which changed from less than 20% to over 90% in the study period³⁹. This will influence the growth rates of variants with immune evasion capabilities over time. The effect of immunity is currently not modelled, but may become more important in the future as SARS-CoV-2 becomes endemic. Further limitations are discussed in the Limitations section of the Methods.

The third and fourth waves in England were each caused by more transmissible variants, which outgrew restrictions that were sufficient to suppress previous variants. During the second national lockdown, Alpha grew despite falling numbers for other lineages and, similarly, Delta took hold in April and May when cases of Alpha were declining. The fact that such growth was initially masked by the falling cases of dominant lineages highlights the need for dense genomic surveillance and rapid analysis to devise optimal and timely control strategies. Such surveillance should ideally be global as, even though Delta was associated with a large wave of cases in India, its transmissibility remained unclear at the time due to a lack of systematic genomic surveillance data.

The 2.4-fold increase in growth rate during the study period as a result of new variants is also likely to have consequences for the future course of the pandemic. If this increase in growth rate was explained solely by higher transmissibility, it would raise the basic reproduction number R₀ from a value of around 2.5–3 in spring 2020 (ref. ⁴⁰) to the range of 6–7 for Delta. This is likely to spur new waves of the epidemic in countries that have to date been able to control the epidemic despite low vaccination rates, and it may exacerbate the situation elsewhere. Although the exact herd-immunity threshold depends on contact patterns and the distribution of immunity across age groups^41,42, it is worth considering that Delta may increase the threshold to values around 0.85. Given current estimates of vaccine efficacy^34,35,43 this would require nearly 100% vaccination coverage. Even though more than 90% of adults had antibodies against SARS-CoV-2 (ref. ³⁹) and close to 70% had received two doses of vaccination, England saw rising Delta variant cases in the first weeks of July 2021. It can therefore be expected that other countries with high vaccination coverage are also likely to experience rising cases when restrictions are lifted.

SARS-CoV-2 is likely to continue its evolutionary adaptation process to humans⁴⁴. To date, variants with considerably higher transmissibility have had strongest positive selection, and swept through England during the 10 months of this investigation. However, the possibility that an increasingly immune population may now select for variants with better immune escape highlights the need for continued systematic and, ideally, global genomic surveillance.

Methods

Pillar 2 SARS-CoV-2 testing data

Publicly available daily SARS-CoV-2 test result data from testing for the wider population outside the National Health Service (Pillar 2 newCasesBySpecimenDate) were downloaded from https://coronavirus.data.gov.uk/ spanning the date range from 1 September 2020 to 30 June 2021 for 315 English LTLAs (downloaded on 20 July 2021). These data are mostly positive PCR tests, with about 4% of results from lateral flow tests without PCR confirmation. In this dataset, the City of London is merged with Hackney, and the Isles of Scilly are merged with Cornwall due to their small number of inhabitants, thereby reducing the number of English LTLAs from 317 to 315. Population data for each LTLA were downloaded from the Office of National Statistics (ONS; https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationestimates/datasets/populationestimatesforukenglandandwalesscotlandandnorthernireland).

SARS-CoV-2 surveillance sequencing

In total, 281,178 tests (September 2020 to June 2021) were collected as part of random surveillance of positive tests of residents of England from four Pillar 2 Lighthouse laboratories. The samples were collected between 1 September 2020 and 26 June 2021. A random selection of samples was taken, after excluding those that were known to be taken during quarantine of recent travellers, and samples from targeted and local surge testing efforts. The available metadata made this selection imperfect, but these samples should be an approximately random selection of infections in England during this time period, and the large sample size makes our subsequent inferences robust.

We amplified RNA extracts from these tests with C_t < 30 using the ARTIC amplicon protocol (https://www.protocols.io/workspaces/coguk/publications). We sequenced 384-sample pools on Illumina NovaSeq, and produced consensus fasta sequences according to the ARTIC nextflow processing pipeline (https://github.com/connor-lab/ncov2019-artic-nf). Lineage assignments were made using Pangolin⁵, according to the latest lineage definitions at the time, except for B.1.617, which we reanalysed after the designation of sublineages B.1.617.1, B.1.617.2 and B.1.617.3. Lineage prevalence was computed from 281,178 genome sequences. The genomes were mapped to the same 315 English LTLAs as for the testing data described above. Mapping was performed from outer postcodes to LTLA, which can introduce some misassignment to neighbouring LTLAs. Furthermore, lineages in each LTLA were aggregated to counts per week for a total of 43 weeks, defined beginning on Sunday and ending on Saturday.

Finally, the complete set of 328 SARS-CoV-2 PANGO lineages was collapsed into l = 71 lineages using the underlying phylogenetic tree, such that each resulting lineage constituted at least 100 genomes, unless the lineage has been designated a VOC, variant under investigation (VUI) or variant in monitoring by Public Health England³².

Spatiotemporal genomic surveillance model

A hierarchical Bayesian model was used to fit local incidence data in a given day in each local authority and jointly estimate the relative historical prevalence and transmission parameters. In the following, t denotes time and is measured in days. We use the convention that bold lowercase symbols, such as b, indicate vectors.

Motivation

Suppose that ${{\bf{x}}}^{{\prime} }(t)=({\bf{b}}+{r}_{0}(t))\cdot {\bf{x}}(t)$ describes the ordinary differential equation (ODE) for the viral dynamics for a set of l different lineages. Here r₀(t) is a scalar time-dependent logarithmic growth rate that is thought to reflect lineage-independent transmission determinants, which changes over time in response to behaviour, non-pharmaceutical interventions (NPIs) and immunity. This reflects a scenario in which the lineages differ only in terms of the intensity of transmission, but not the intergeneration time distribution. The ODE is solved by ${\bf{x}}(t)={{\rm{e}}}^{{\bf{c}}+{\bf{b}}t+{\int }_{{t}_{0}}^{t}{r}_{0}(t){\rm{d}}t}={{\rm{e}}}^{{\bf{c}}+{\bf{b}}t}\nu (t)$. The term ν(t) contributes the same factor to each lineage and therefore drops from the relative proportions of lineages ${\bf{p}}(t)=\frac{{\bf{x}}(t)}{\sum {\bf{x}}(t)}\propto {{\rm{e}}}^{{\bf{c}}+{\bf{b}}t}$.

In the given model, the lineage prevalence p(t) follows a multinomial logistic-linear trajectory. Moreover, the total incidence factorizes into ${\boldsymbol{\mu }}(t)=\nu (t)\sum {{\rm{e}}}^{{\bf{c}}+{\bf{b}}t}$, which provides a basis to separately estimate the total incidence µ(t) from Pillar 2 test data and lineage-specific prevalence p(t) from genomic surveillance data (which are taken from a varying proportion of positive tests). By using the equations above, one can subsequently calculate lineage-specific estimates by multiplying µ(t) with the respective genomic proportions p(t).

Incidence

In the following text, we describe a flexible semi-parametric model of the incidence. Let µ(t) be the expected daily number of positive Pillar 2 tests and s the population size in each of 315 LTLAs. Denote ${\boldsymbol{\lambda }}(t)=\,\log \,{\boldsymbol{\mu }}(t)-\,\log (s)$ the logarithmic daily incidence per capita at time t in each of the 315 LTLAs.

Suppose f(t) is the daily number of new infections caused by the number of people infected at time t. As new cases are noticed and tested only after a delay u with distribution g, the observed number of cases f ^*(t) will be given by the convolution

$${f}^{\ast }(t)={\int }_{0}^{{\rm{\infty }}}g(u)f(t-u){\rm{d}}u=(g\ast f)(t).$$

The time from infection to test is given by the incubation time plus the largely unknown distribution of the time from symptoms to test, which, in England, was required to take place within 5 d of symptom onset. To account for these factors, the log normal incubation time distribution from ref. ⁴⁶ is scaled by the equivalent of changing the mean by 2 d. The convolution shifts cases approximately 6 d into the future and also spreads them out according to the width of g (Extended Data Fig. 2a).

To parametrize the short- and longer-term changes of the logarithmic incidence ${\boldsymbol{\lambda }}(t)$, we use a combination of h weekly and k − h monthly cubic basis splines ${\bf{f}}(t)=({f}_{1}(t),\ldots ,{f}_{k}(t)).$ The knots of the h weekly splines uniformly tile the observation period except for the last 6 weeks.

Each spline basis function is convolved with the time to test distribution g, ${{\bf{f}}}^{\ast }(t)=({f}_{1}^{\ast }(t),\ldots ,{f}_{k}^{\ast }(t))$ as outlined above and used to fit the logarithmic incidence. The derivatives of the original basis f′(t) are used to calculate the underlying growth rates and R_t values, as shown further below. The convolved spline basis f*(t) is used to fit the per capita incidence in each LTLA as (Extended Data Fig. 2b):

$${\boldsymbol{\lambda }}(t)={\bf{B}}\times {{\bf{f}}}^{\ast }(t).$$

This implies that fitting the incidence function for each of the m local authorities is achieved by a suitable choice of coefficients ${\bf{B}}\in {{\mathbb{R}}}^{m\times k}$, that is one coefficient for each spline function for each of the LTLAs. The parameters B have a univariate normal prior distribution each, which reads for LTLA i and spline j:

$${{\bf{B}}}_{i,j}\sim N(0,{\sigma }_{j}).$$

The s.d. of the prior regularizes the amplitude of the splines and is chosen as ${\sigma }_{j}=0.2$ for weekly splines and ${\sigma }_{j}=1$ for monthly splines. This choice was found to reduce the overall variance resulting from the high number of weekly splines, meant to capture rapid changes in growth rates, but which can lead to instabilities particularly at the end of the time series, when not all effects of changes in growth rates are observed yet. The less regularized monthly splines reflect trends on the scale of several weeks and are therefore subject to less noise.

Finally, we introduce a term accounting for periodic differences in weekly testing patterns (there are typically 30% lower specimens taken on weekends; Fig. 1a):

$$\mathop{{\boldsymbol{\mu }}}\limits^{ \sim }={\boldsymbol{\mu }}(t)\cdot \delta (t),$$

where the scalar $\delta (t)=\delta (t-i\times 7)\,{\rm{\forall }}i\in {\mathbb{N}}$ and prior distribution $\delta (t)\sim {\rm{L}}{\rm{o}}{\rm{g}}{\rm{N}}{\rm{o}}{\rm{r}}{\rm{m}}{\rm{a}}{\rm{l}}(0,1)$ for $t=1,\ldots ,6$ and $\delta (0)=1$.

The total incidence was fitted to the observed number of positive daily tests X by a negative binomial with a dispersion $\omega =10$. The overdispersion buffers against non-Poissonian uncorrelated fluctuations in the number of daily tests.

$${\bf{X}}(t)\sim {\rm{N}}{\rm{B}}(\mathop{{\boldsymbol{\mu }}}\limits^{ \sim }(t),\omega ).$$

The equation above assumes that all elements of X(t) are independent, conditional on $\tilde{{\boldsymbol{\mu }}}(t)$.

Growth rates and R _t values

A convenient consequence of the spline basis of $\log ({\boldsymbol{\mu }})={\boldsymbol{\lambda }}$, is that the delay-adjusted daily logarithmic growth rate r(t) = λ′(t) of the local epidemic simplifies to:

$${\bf{r}}(t)={\bf{B}}\times {{\bf{f}}}^{{\prime} }(t),$$

where ${{\bf{f}}}_{j}^{{\prime} }(t)$ represents the first derivative of the jth cubic spline basis function.

To express the daily growth rate as an approximate reproductive number R_t, one needs to consider the distribution of the intergeneration time, which is assumed to be gamma distributed with mean 6.3 d (α = 2.29, β = 0.36)⁴⁶. The R_t value can be expressed as a Laplace transform of the intergeneration time distribution⁴⁷. Effectively, this shortens the relative time period because the exponential dynamics put disproportionally more weight on stochastically early transmissions over late ones. For reasons of simplicity and being mindful also of the uncertainties of the intergeneration time distribution, we approximate R_t values by multiplying the logarithmic growth rates with a value of ${\bar{\tau }}_{{\rm{e}}}$ = 5.1 d, which was found to be a reasonable approximation to the convolution required to calculate R_t values (denoted here by the lower case symbol ${\boldsymbol{\rho }}(t)$ in line with our convention for vector-variate symbols and to avoid confusion with the epidemiological growth rate r_t),

$$\log ({\boldsymbol{\rho }}(t))\approx \frac{{\rm{d}}\,\log ({\boldsymbol{\mu }}(t))}{{\rm{d}}t}{\bar{\tau }}_{{\rm{e}}}={\bf{r}}(t){\bar{\tau }}_{{\rm{e}}}$$

Thus, the overall growth rate scaled to an effective inter generation time of 5.1 d can be readily derived from the derivatives of the spline basis and the corresponding coefficients. The values derived from the approach are in very close agreement with those of the method of ref. ⁴⁸, but shifted according to the typical delay from infection to test (Extended Data Fig. 2b).

Genomic prevalence

The dynamics of the relative frequency P(t) of each lineage was modelled using a logistic-linear model in each LTLA, as described above. The logistic prevalence of each lineage in each LTLA is defined as ${\bf{L}}(t)={\rm{l}}{\rm{o}}{\rm{g}}{\rm{i}}{\rm{t}}({\bf{P}}(t))$. This is modelled using the piecewise linear expression

$${\bf{L}}(t)={\bf{C}}+{\bf{b}}\cdot {{\bf{t}}}_{+},$$

where b may be interpreted as a lineage-specific growth advantage and C as an offset term of dimension (LTLA × lineages). Time ${{\bf{t}}}_{+}$ is measured since introduction t₀ and is defined as

$${{\bf{t}}}_{+}=t-{{\bf{t}}}_{0}\,{\rm{i}}{\rm{f}}\,t > {{\bf{t}}}_{0}\,{\rm{e}}{\rm{l}}{\rm{s}}{\rm{e}}-{\rm{\infty }}$$

and accounts for the fact that lineages can be entirely absent prior to a stochastically distributed time period preceding their first observation. This is because, in the absence of such a term, the absence of a lineage prior to the point of observation can only be explained by a higher growth rate compared with the preceding lineages, which may not necessarily be the case. As the exact time of introduction is generally unknown, a stochastic three-week period of ${{\bf{t}}}_{0}\sim {\rm{Unif}}(-14,0)+{{\bf{t}}}_{0}^{{\rm{obs}}}$prior to the first observation ${{\bf{t}}}_{0}^{{\rm{obs}}}$ was chosen.

As the inverse logit transformation projects onto the l − 1 dimensional simplex ${S}_{l-1}$ and therefore loses one degree of freedom, B.1.177 was set as a baseline with

$${{\bf{L}}}_{\cdot ,0}(t)=0.$$

The offset parameters C are modelled across LTLAs as independently distributed multivariate normal random variables with a lineage-specific mean c and covariance $\Sigma =10\cdot {I}_{l-1}$, where ${I}_{l-1}$ denotes an $(l-1)\times (l-1)$ identity matrix. The lineage-specific parameters growth rate b and average offset c are modelled using IID Normal prior distributions

$${\bf{b}}\sim N(0,0.2)$$

$${\bf{c}}\sim N(-10,5)$$

The time-dependent relative prevalence P(t) of SARS-CoV2 lineages was fitted to the number of weekly genomes Y(t) in each LTLA by a Dirichlet-multinomial distribution with expectation ${\mathbb{E}}[{\bf{Y}}(t)]\approx {\bf{P}}(t)\cdot {\bf{G}}(t)$ where G(t) are the total number of genomes sequenced from each LTLA in each week. For LTLA i, this is defined as:

$${{\bf{Y}}}_{i,\cdot }(t)\sim {\rm{D}}{\rm{i}}{\rm{r}}{\rm{M}}{\rm{u}}{\rm{l}}{\rm{t}}({\alpha }_{0}+{{\boldsymbol{\alpha }}}_{1}{{\bf{P}}}_{i,\cdot }(t),{{\bf{G}}}_{i}(t)).$$

The scalar parameter ${{\boldsymbol{\alpha }}}_{0}=0.01$ can be interpreted as a weak prior with expectation 1/n, making the model less sensitive to the introduction of single new lineages, which can otherwise exert a very strong effect. Furthermore, the array ${{\boldsymbol{\alpha }}}_{1}=\frac{{\rm{cases}}}{2}$ increases the variance to account for the fact that, especially at high sequencing coverage (genomes ≈ cases), cases and therefore genomes are likely to be correlated and overdispersed as they may derive from a single transmission event. Other choices such as ${{\boldsymbol{\alpha }}}_{1}=1,000$, which make the model converge to a standard multinomial, leave the conclusions qualitatively unchanged. This model aspect is illustrated in Extended Data Fig. 2c.

Lineage-specific incidence and growth rates

From the two definitions above it follows that the lineage-specific incidence is given by multiplying the total incidence in each LTLA µ(t) with the corresponding lineage frequency estimate P(t)for lineage j at each time point

${{\bf{M}}}_{\cdot ,j}(t)={\boldsymbol{\mu }}(t)\cdot {{\bf{P}}}_{\cdot ,j}(t)$ for $j=0,\ldots ,l-1$

Further corresponding lineage-specific R_t values R(t) in each LTLA can be calculated from the lineage-agnostic average R_t value ρ(t) and the lineage proportions P(t) as

$$\log {\bf{R}}(t)=\,\log \,{\boldsymbol{\rho }}(t)+{\bar{\tau }}_{{\rm{e}}}({\bf{b}}-{\bf{P}}(t)\times {\bf{b}})$$

By adding the log-transformed growth rate fold changes b and subtracting the average log-transformed growth rate change ${\bf{P}}(t)\times {\bf{b}}$, it follows that ${{\bf{R}}}_{i,\cdot }(t)={{\bf{R}}}_{i,0}(t){{\rm{e}}}^{{\bar{\tau }}_{{\rm{e}}}{\bf{b}}}$, where ${{\bf{R}}}_{i,0}(t)$ is the R_t value of the reference lineage j = 0 (for which ${{\bf{b}}}_{0}=0$) in LTLA i. It follows that all other lineage-specific the R_t values are proportional to this baseline at any given point in time with factor ${{\rm{e}}}^{{\bar{\tau }}_{{\rm{e}}}{\bf{b}}}$.

Inference

The model was implemented in numpyro^49,50 and fitted using stochastic variational inference⁵¹. Guide functions were multivariate normal distributions for each row (corresponding to an LTLA) of B, C to preserve the correlations across lineages and time as well as for (b, c) to also model correlations between growth rates and typical introduction.

Phylogeographic analyses

To infer VOC introduction events into the UK and corresponding clade sizes, we investigated VOC genome sequences from GISAID (https://www.gisaid.org/) available from any country. We downloaded multiple sequence alignments of genome sequences with the release dates 17 April 2021 (for the analysis of the lineages A.23.1, B.1.1.318, B.1.351 andB.1.525) and 5 May 2021 (for the analysis of the B.1.617 sublineages). We next extracted a subalignment from each lineage (according to the 1 April 2021 version of PANGOlin for the 17 April 2021 alignment and the 23 April 2021 version of PANGOlin for the 5 May 2021 alignment) and, for each subalignment, we inferred a phylogeny through maximum likelihood using FastTree2 (v.2.1.11)⁵² with the default options and GTR substitution model⁵³.

On each VOC/VUI phylogeny, we inferred the minimum and maximum number of introductions of the considered SARS-CoV-2 lineage into the UK compatible with a parsimonious migration history of the ancestors of the considered samples; we also measured clade sizes for one specific example parsimonious migration history. We counted only introduction events into the UK that resulted in at least one descendant from the set of UK samples that we considered in this work for our hierarchical Bayesian model; similarly, we measured clade sizes by the number of UK samples considered here included in such clades. Multiple occurrences of identical sequences were counted as separate cases, as this helped us to identify rapid SARS-CoV-2 spread.

When using parsimony, we considered only migration histories along a phylogenetic tree that are parsimonious in terms of the number of migration events from and to the UK (in practice, we collapse all of the non-UK locations into a single one). Furthermore, as SARS-CoV-2 phylogenies present substantial numbers of polytomies, that is, phylogenetic nodes where the tree topology cannot be reconstructed due to a lack of mutation events on certain branches, we developed a tailored dynamic programming approach to efficiently integrate over all possible splits of polytomies and over all possible parsimonious migration histories. The idea of this method is somewhat similar to typical Bayesian phylogeographic inference⁵⁴ in that it enables us to at least in part integrate over phylogenetic uncertainty and uncertainty in migration history; however, it also represents a very simplified version of these analyses, more so than ref. ¹⁶, as it considers most of the phylogenetic tree as fixed, ignores sampling times and uses parsimony instead of a likelihood-based approach. Parsimony is expected to represent a good approximation in the context of SARS-CoV-2, due to the shortness (both in time and substitutions) of the phylogenetic branches considered^55,56. The main advantage of our approach is that, owing to the dynamic programming implementation, it is more computationally efficient than Bayesian alternatives, as the most computationally demanding step is the inference of the maximum likelihood phylogenetic tree. This enables us to infer plausible ranges for numbers of introduction events for large datasets and to quickly update our analyses as new sequences become available. The other advantage of this approach is that it enables us to easily customize the analysis and to focus on inferred UK introductions that result in at least one UK surveillance sample, while still making use of non-surveillance UK samples to inform the inferred phylogenetic tree and migration history. Note that possible biases due to uneven sequencing rates across the world⁵⁵ apply to our approach as well as other popular phylogeographic methods. Our approach works by traversing the maximum likelihood tree starting from the terminal nodes and ending at the root (postorder traversal). Here, we define a ‘UK clade’ as a maximal subtree of the total phylogeny for which all terminal nodes are from the UK, all internal nodes are inferred to be from the UK and at least one terminal node is a UK surveillance sample; the size of a UK clade is defined as the number of UK surveillance samples in it. At each node, using values already calculated for all children nodes (possibly more than two children in the case of a multifurcation), we calculate the following quantities: (1) the maximum and minimum number of possible descendant UK clades of the current node, over the space of possible parsimonious migration histories, and conditional on the current node being UK or non-UK; (2) the number of migration events compatible with a parsimonious migration history in the subtree below the current node, and conditional on the current node being UK or non-UK; (3) the size so far of the UK clade the current node is part of, conditional on it being UK; and (4) a sample of UK clade sizes for the subtree below the node. To calculate these quantities, for each internal node, and conditional on each possible node state (UK or non-UK), we consider the possible scenarios of having 0 of 1 migration events between the internal node and its children nodes (migration histories with more than 1 migration event between the node and its children are surely not parsimonious in our analysis and can be ignored).

To confirm the results of our analyses based on parsimony, we also used the new Bayesian phylogenetic approach Thorney BEAST¹⁶ (https://beast.community/thorney_beast) for VOCs for which it was computationally feasible, that is, excluding B.1.351. For each VOC, we used in Thorney BEAST the same topology inferred with FastTree2 as for our parsimony analysis; we also used treetime⁵⁷ v.0.8.2 to estimatea timed tree and branch divergences for use in Thorney BEAST. We used a two-state (UK and non-UK) migration model⁵⁴ of migration to infer introductions into the UK but again counted, from the posteriorsample trees, only UK clades with at least one UK surveillance sample.We used a Skygrid⁵⁸ tree coalescent prior with six time intervals. The comparison of parsimony and Bayesian estimates is shown in Extended Data Fig. 8d.

ONS infection survey analysis

Data from the cross-sectional infection survey were downloaded from https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/bulletins/coronaviruscovid19infectionsurveypilot/30april2021.

Comparison of ONS incidence estimates with hospitalization, case and death rates was conducted by estimating infection trajectories separately from observed cases, hospitalizations and deaths^59,60, convolving them with estimated PCR detection curves⁶¹, and dividing the resulting PCR prevalence estimates by the estimated prevalence from the ONS Community Infection Survey at the midpoints of the two-week intervals over which prevalence was reported in the survey.

Maps

Maps were plotted using LTLA shapefiles (https://geoportal.statistics.gov.uk/datasets/69dc11c7386943b4ad8893c45648b1e1), sourced from the ONS, which is licensed under the Open Government Licence v.3.0.

Limitations

A main limitation of the analysis is that the transmission model is deterministic, whereas the spread of variants is a stochastic process. Although the logistic growth assumption is a consistent estimator of the average transmission dynamics, individual outbreaks may deviate from these averages and therefore produce unreliable estimates.

Stochastic growth effects are accounted for only in terms of (uncorrelated) overdispersion and the offset at the time of the introduction. For these reasons, the estimated growth rates may not accurately reflect the viral transmissibility, especially at a low prevalence. It is therefore important to assess whether consistent growth patterns in multiple independent areas are observed. We note that the posterior distribution of the growth rates of rare variants tends to be biased to the baseline due to the centred prior.

In its current form, the model accounts for only a single introduction event per LTLA. Although this problem is in part alleviated by the high spatial resolution, which spreads introductions across 315 LTLAs, it is important to investigate whether sustained introductions inflate the observed growth rates, as in the case of the Delta variant or other VOCs and VUIs. This can be achieved by a more detailed phylogeographic assessment and through the assessment of monophyletic sublineages.

Furthermore, there is no explicit transmission modelled from one LTLA to another. As each introduction is therefore modelled separately, this makes the model conservative in ascertaining elevated transmission as single observed cases across different LTLAs can be explained by their introduction.

The inferred growth rates also cannot identify a particular mechanism of altered transmission. Biological mechanisms include a higher viral load, longer infectivity or greater susceptibility. Lineages could potentially differ by their intergeneration time, which would lead to nonlinear scaling. Here we did not find convincing evidence in incidence data for such effects, in contrast to previous reports²³. However, contact-tracing data indicate that the intergeneration time may be shortening for more transmissible lineages such as Delta^33,62. Cases of the Beta and Gamma VOCs may have been more intensely contact traced and triggered asymptomatic surge testing in some postcode areas. This may have reduced the observed growth rates relative to other lineages.

Lineages, such as Beta, Gamma or Delta also differ in their ability to evade previous immunity. As immunity changes over time, this might lead to a differential growth advantage over time. It is therefore advisable to assess whether a growth advantage is constant over periods in which immunity changes considerably.

A further limitation underlies the nature of lineage definition and assignment. The PANGO lineage definition⁵ assigns lineages to geographical clusters, which have by definition expanded, and this can induce a certain survivor bias, often followed by winner’s curse. Another issue results from the fact that very recent variants may not be classified as a lineage despite having grown, which can inflate the growth rate of ancestral lineages over sublineages.

As the total incidence is modelled on the basis of the total number of positive PCR tests, it may be influenced by testing capacity; the total number of tests approximately tripled between September 2020 and March 2021. This can potentially lead to a time trend in recorded cases and therefore baseline R_t values if the access to testing changed, for example, by too few tests being available tests during periods of high incidence, or changes to the eligibility to intermittently test with fewer symptoms. Generally, the observed incidence was in good agreement with representative cross-sectional estimates from the ONS^63,64, except for a period of peak incidence from late December 2020 to January 2021 (Extended Data Fig. 1d). Values after 8 March 2021 need to be interpreted with caution as Pillar 2 PCR testing was supplemented by lateral flow devices, which increased the number of daily tests to more than 1.5 million. Positive cases were usually confirmed by PCR and counted only once.

The modelled curves are smoothed over intervals of approximately 7 d using cubic splines, creating the possibility that later time points influence the period of investigation and cause a certain waviness of the R_t value pattern. An alternative parameterization using piecewise linear basis functions per week (that is, constant R_t values per week) leaves the overall conclusions and extracted parameters broadly unchanged.

Ethical approval

This study was performed as part of surveillance for COVID-19 under the auspices of Section 251 of the National Health Service Act 2006. It therefore did not require individual patient consent or ethical approval. The COG-UK study protocol was approved by the Public Health England Research Ethics Governance Group.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

Data availability

PCR test data are publicly available online (https://coronavirus.data.gov.uk/). A filtered, privacy conserving version of the lineage–LTLA–week dataset is publicly available online (https://covid19.sanger.ac.uk/downloads) and enables strong reproduction of our results, despite a small number of cells having been suppressed to avoid disclosure. Full SARS-CoV-2 genome data and geolocations can be obtained under controlled access from https://www.cogconsortium.uk/data/. Application for full data access requires a description of the planned analysis and can be initiated at coguk_DataAccess@medschl.cam.ac.uk. The data and a version of the analysis with fewer lineages can be interactively explored at https://covid19.sanger.ac.uk. Source data are provided with this paper.

Code availability

The genomic surveillance model is implemented in Python and available at GitHub (https://github.com/gerstung-lab/genomicsurveillance) and as a PyPI package (genomicsurveillance). Specific code for the analyses of this study can be found as individual Google colab notebooks in the same repository. These were run using Python v.3.7.1 (packages: matplotlib (v.3.4.1), numpy (v.1.20.2), pandas (v.1.2.3), scikit-learn (v.0.19.1), scipy (v.1.6.2), seaborn (v.0.11.1), jax (v.0.2.8), genomicsurveillance (v.0.4.0), numpyro (v.0.4.0)). The phylogeographic analyses were performed using Thorney Beast (v.0.1.1) and https://github.com/NicolaDM/phylogeographySARS-CoV-2. Code for the ONS infection survey analysis is available at GitHub (https://github.com/jhellewell14/ons_severity_estimates).

Change history

07 June 2022
A Correction to this paper has been published: https://doi.org/10.1038/s41586-022-04887-8

References

Rambaut, A. Phylogenetic Analysis of nCoV-2019 Genomes (Virological, 2020); https://virological.org/t/phylodynamic-analysis-176-genomes-6-mar-2020/356
Nextstrain Team Genomic Epidemiology of Novel Coronavirus—Global Subsampling (Nextstrain, 2020); https://nextstrain.org/ncov/global?l=clock
Hadfield, J. et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics 34, 4121–4123 (2018).
Article CAS PubMed PubMed Central Google Scholar
Volz, E. et al. Evaluating the effects of SARS-CoV-2 spike mutation D614G on transmissibility and pathogenicity. Cell 184, 64–75 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rambaut, A. et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 5, 1403–1407 (2020).
Article CAS PubMed PubMed Central Google Scholar
O’Toole, Á. et al. Global Report Investigating Novel Coronavirus Haplotypes https://cov-lineages.org/global_report.html (2021).
Hodcroft, E. B. et al. Spread of a SARS-CoV-2 variant through Europe in the summer of 2020. Nature 595, 707–712 (2021).
Article ADS CAS PubMed Google Scholar
Tegally, H., Wilkinson, E., Lessells, R.J. et al. Sixteen novel lineages of SARS-CoV-2 in South Africa. Nat. Med. 27, 440–446 (2021).
Rambaut, A. et al. Preliminary Genomic Characterisation of an Emergent SARS-CoV-2 Lineage in the UK Defined by a Novel Set of Spike Mutations (Virological, 2020); https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563
Volz, E., Mishra, S., Chand, M. et al. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature 593, 266–269 (2021).
Davies, N. G. et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science 372, eabg3055 (2021).
O’Toole, Á. et al. Tracking the International Spread of SARS-CoV-2 Lineages B.1.1.7 and B.1.351/501Y-V2 (Virological, 2021); https://virological.org/t/tracking-the-international-spread-of-sars-cov-2-lineages-b-1-1-7-and-b-1-351-501y-v2/592
Washington, N. L. et al. Emergence and rapid transmission of SARS-CoV-2 B.1.1.7 in the United States. Cell 184, 2587–2594 (2021).
Article CAS PubMed PubMed Central Google Scholar
Faria, N. R. et al. Genomic Characterisation of an Emergent SARS-CoV-2 Lineage in Manaus: Preliminary Findings (Virological, 2021); https://www.icpcovid.com/sites/default/files/2021-01/Ep%20102-1%20Genomic%20characterisation%20of%20an%20emergent%20SARS-CoV-2%20lineage%20in%20Manaus%20Genomic%20Epidemiology%20-%20Virological.pdf
Faria, N. R. et al. Genomics and Epidemiology of the P.1 SARS-CoV-2 Lineage in Manaus, Brazil. Science 372, 815–21 (2021).
du Plessis, L. et al. Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science 371, 708–712 (2021).
Article ADS PubMed Google Scholar
Danish Covid-19 Genome Consortium Genomic Overview of SARS-CoV-2 in Denmark (2021); https://www.covid19genomics.dk/statistics
Kraemer, M. U. G. et al. Spatiotemporal invasion dynamics of SARS-CoV-2 lineage B.1.1.7 emergence. Science 373, 889–895 (2021).
Article ADS CAS PubMed Google Scholar
Park, S. W. et al Roles of generation-interval distributions in shaping relative epidemic strength, speed, and control of new SARS-CoV-2 variants. Preprint at medRxiv https://doi.org/10.1101/2021.05.03.21256545 (2021).
Starr, T. N. et al. Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding. Cell 182, 1295–1310 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zahradník, J., Marciano, S., Shemesh, M. et al. SARS-CoV-2 variant prediction and antiviral drug design are enabled by RBD in vitro evolution. Nat. Microbiol. 6, 1188–1198 (2021).
Brown, J. C. et al. Increased transmission of SARS-CoV-2 lineage B.1.1.7 (VOC 2020212/01) is not accounted for by a replicative advantage in primary airway cells or antibody escape. Preprint at bioRxiv https://doi.org/10.1101/2021.02.24.432576 (2021).
Vöhringer, H. et al. Lineage-specific Growth of SARS-CoV-2 B.1.1.7 During the English National Lockdown (Virological, 2020); https://virological.org/t/lineage-specific-growth-of-sars-cov-2-b-1-1-7-during-the-english-national-lockdown/575/2
The Health Protection (Coronavirus, Restrictions) (All Tiers) (England) Regulations 2020. Wikipedia https://en.wikipedia.org/w/index.php?title=The_Health_Protection_(Coronavirus,_Restrictions)_(All_Tiers)_(England)_Regulations_2020&oldid=1014831173 (2021).
Steel, K. & Davies, B. Coronavirus (COVID-19) Infection Survey, Antibody and Vaccination Data for the UK (ONS, 2021); https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/articles/coronaviruscovid19infectionsurveyantibodydatafortheuk/28april2021
Greaney, A. J. et al. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host & Microbe 29, 463-476 (2021).
Greaney, A. J. et al. Complete mapping of mutations to the SARS-CoV-2 spike receptor-binding domain that escape antibody recognition. Cell Host Microbe 29, 44–57 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhou, D. et al. Evidence of escape of SARS-CoV-2 variant B.1.351 from natural and vaccine-induced sera. Cell 184, 2348–2361 (2021).
Article CAS PubMed PubMed Central Google Scholar
Planas, D. et al. Sensitivity of infectious SARS-CoV-2 B.1.1.7 and B.1.351 variants to neutralizing antibodies. Nat. Med. 27, 917–924 (2021).
Article CAS PubMed Google Scholar
Peacock, T. P. et al. The SARS-CoV-2 variants associated with infections in India, B.1.617, show enhanced spike cleavage by furin. Preprint at bioRxiv https://doi.org/10.1101/2021.05.28.446163 (2021).
Campbell, F. et al. Increased transmissibility and global spread of SARS-CoV-2 variants of concern as at June 2021. Euro Surveill. 26, 2100509 (2021).
Article CAS PubMed Central Google Scholar
Investigation of Novel SARS-CoV-2 Variants of Concern Technical briefing 10 (Public Health England, 2021); https://www.gov.uk/government/publications/investigation-of-novel-sars-cov-2-variant-variant-of-concern-20201201
Li, B. et al. Viral infection and transmission in a large well-traced outbreak caused by the Delta SARS-CoV-2 variant. Preprint at medRxiv https://doi.org/10.1101/2021.07.07.21260122 (2021).
Nasreen, S. et al. Effectiveness of COVID-19 vaccines against variants of concern in Ontario, Canada. Preprint at medRxiv https://doi.org/10.1101/2021.06.28.21259420 (2021).
Lopez Bernal, J. et al. Effectiveness of Covid-19 vaccines against the B.1.617.2 (Delta) variant. N. Engl. J. Med. 385, 585–594 (2021).
Article PubMed Google Scholar
Investigation of Novel SARS-CoV-2 Variants of Concern Technical briefing 19 (Public Health England, 2021); https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/1005517/Technical_Briefing_19.pdf
Ferreira, I. et al. SARS-CoV-2 B.1.617 emergence and sensitivity to vaccine-elicited antibodies. Preprint at bioRxiv https://doi.org/10.1101/2021.05.08.443253 (2021).
Wall, E. C. et al. Neutralising antibody activity against SARS-CoV-2 VOCs B.1.617.2 and B.1.351 by BNT162b2 vaccination. Lancet 397, 2331–2333 (2021).
Article CAS PubMed PubMed Central Google Scholar
Steel, K. & Haughton, P. Coronavirus (COVID-19) Infection Survey, Antibody and Vaccination Data, UK (ONS, 2021); https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/bulletins/coronaviruscovid19infectionsurveyantibodyandvaccinationdatafortheuk/21july2021
Anderson, R. et al. Reproduction Number (R) and Growth Rate (r) of the COVID-19 Epidemic in the UK: Methods of Estimation, Data Sources, Causes of Heterogeneity, and use as a Guide in Policy Formulation (The Royal Society, 2020).
Britton, T., Ball, F. & Trapman, P. A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. Science 369, 846–849 (2020).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Funk, S. et al. Combining serological and contact data to derive target immunity levels for achieving and maintaining measles elimination. BMC Med. 17, 180 (2019).
Article PubMed PubMed Central Google Scholar
Hodgson, D., Flasche, S., Jit, M., Kucharski, A. J. & CMMID COVID-19 Working Group The potential for vaccination-induced herd immunity against the SARS-CoV-2 B.1.1.7 variant. Euro Surveill. 26, 2100428 (2021).
Article CAS PubMed Central Google Scholar
van Dorp, L., Houldcroft, C. J., Richard, D. & Balloux, F. COVID-19, the first pandemic in the post-genomic era. Curr. Opin. Virol. 50, 40–48 (2021).
Article PubMed PubMed Central Google Scholar
The Health Protection (Coronavirus, Restrictions) (England) (No. 4) Regulations 2020. Wikipedia https://en.wikipedia.org/w/index.php?title=The_Health_Protection_(Coronavirus,_Restrictions)_(England)_(No._4)_Regulations_2020&oldid=1014701607 (2021).
Bi, Q. et al. Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. Lancet Infect. Dis. 20, 911–919 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wallinga, J. & Lipsitch, M. How generation intervals shape the relationship between growth rates and reproductive numbers. Proc. Biol. Sci. 274, 599–604 (2007).
CAS PubMed Google Scholar
Cori, A., Ferguson, N. M., Fraser, C. & Cauchemez, S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am. J. Epidemiol. 178, 1505–1512 (2013).
Article PubMed Google Scholar
Bingham, E. et al. Pyro: deep universal probabilistic programming. Preprint at http://arxiv.org/abs/1810.09538 (2018).
Phan, D., Pradhan, N. & Jankowiak, M. Composable effects for flexible and accelerated probabilistic programming in NumPyro. Preprint at http://arxiv.org/abs/1912.11554 (2019).
Hoffman, M. D., Blei, D. M., Wang, C. & Paisley, J. Stochastic variational inference. J. Mach. Learn. Res. 14, 1303–1347 (2013).
MathSciNet MATH Google Scholar
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS ONE 5, e9490 (2010).
Article ADS PubMed PubMed Central Google Scholar
Tavaré, S. Some probabilistic and statistical problems in the analysis of DNA sequences. Lect. Math. Life Sci. 17, 57–86 (1986).
MathSciNet MATH Google Scholar
Lemey, P., Rambaut, A., Drummond, A. J. & Suchard, M. A. Bayesian phylogeography finds its roots. PLoS Comput. Biol. 5, e1000520 (2009).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
De Maio, N., Wu, C.-H., O’Reilly, K. M. & Wilson, D. New routes to phylogeography: a bayesian structured coalescent approximation. PLoS Genet. 11, e1005421 (2015).
Article PubMed PubMed Central Google Scholar
Turakhia, Y. et al. Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic. Nat. Genet. 53, 809–816 (2021).
Article CAS PubMed Google Scholar
Sagulenko, P., Puller, V. & Neher, R. A. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 4, vex042 (2018).
Article PubMed PubMed Central Google Scholar
Gill, M. S. et al. Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci. Mol. Biol. Evol. 30, 713–724 (2013).
Article CAS PubMed Google Scholar
Sherratt, K. et al. Exploring surveillance data biases when estimating the reproduction number: with insights into subpopulation transmission of COVID-19 in England. Philos. Trans. Royal Soc. B 376 https://doi.org/10.1098/RSTB.2020.0283 (2021).
Abbott, S. et al. Estimating the time-varying reproduction number of SARS-CoV-2 using national and subnational case counts. Wellcome Open Res. 5, 112 (2020).
Article Google Scholar
Hellewell, J. et al. Estimating the effectiveness of routine asymptomatic PCR testing at different frequencies for the detection of SARS-CoV-2 infections. BMC Med. 19, 106 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hart, W. S. et al. Inference of SARS-CoV-2 generation times using UK household data. Preprint at medRxiv https://doi.org/10.1101/2021.05.27.21257936 (2021).
Pouwels, K. B. et al. Community prevalence of SARS-CoV-2 in England from April to November, 2020: results from the ONS Coronavirus Infection Survey. Lancet Publ. Health 6, e30–e38 (2021).
Article Google Scholar
Donnarumma, K. S. Coronavirus (COVID-19) Infection Survey, UK (ONS, 2021); https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/bulletins/coronaviruscovid19infectionsurveypilot/23april2021

Download references

Acknowledgements

We thank E. Allara (Cambridge) and G. Whitton (Sanger) for providing outer postcodes to LTLA mappings; R. Beale for comments and J. McCrone for setting up Thorney Beast analysis; all of the contributors who submitted genome sequences to GISAID (acknowledgement tables for individual sequences are provided at GitHub; https://github.com/NicolaDM/phylogeographySARS-CoV-2); and our colleagues at EMBL-EBI, the Wellcome Sanger Institute and COG-UK for discussions and comments on this manuscript. COG-UK is supported by funding from the Medical Research Council (MRC), part of UK Research & Innovation (UKRI), the National Institute of Health Research (NIHR) and Genome Research Limited, operating as the Wellcome Sanger Institute. Additional sequence generation was funded by the Department of Health and Social Care. H.S.V., J.P.G. and M.G. are supported by a grant from the Department of Health and Social Care. A.W.J., E.B. and M.G. are beneficiaries from grant NNF17OC0027594 from the Novo Nordisk Foundation. E.V. is supported by Wellcome Trust grant 220885/Z/20/Z. T.S. is supported by grant 210918/Z/18/Z, and J.H. and S.F. by grant 210758/Z/18/Z from the Wellcome Trust. H.S.V., N.D.M., A.W.J., N.G., E.B. and M.G. are supported by EMBL.

Funding

Open access funding provided by Deutsches Krebsforschungszentrum (DKFZ).

Author information

Authors and Affiliations

European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Hinxton, UK
Harald S. Vöhringer, Nicola De Maio, Alexander W. Jung, Nick Goldman, Ewan Birney & Moritz Gerstung
Wellcome Sanger Institute, Hinxton, UK
Theo Sanderson, Matthew Sinnott, Thuy Nguyen, Richard Goater, Frank Schwach, Cristina V. Ariani, Sonia Gonçalves, David K. Jackson, Ian Johnston, Callum Saint, John Sillitoe, Maria Suciu, Irina Abnizova, Louise Aigrain, Alex Alderton, Mozam Ali, Laura Allen, Roberto Amato, Ralph Anderson, Cristina Ariani, Siobhan Austin-Guest, Sendu Bala, Jeffrey Barrett, Andrew Bassett, Kristina Battleday, James Beal, Mathew Beale, Charlotte Beaver, Sam Bellany, Tristram Bellerby, Katie Bellis, Duncan Berger, Matt Berriman, Emma Betteridge, Paul Bevan, Simon Binley, Jason Bishop, Kirsty Blackburn, James Bonfield, Nick Boughton, Sam Bowker, Timothy Brendler-Spaeth, Iraad Bronner, Tanya Brooklyn, Sarah Kay Buddenborg, Robert Bush, Catarina Caetano, Alex Cagan, Nicola Carter, Joanna Cartwright, Tiago Carvalho Monteiro, Liz Chapman, Tracey-Jane Chillingworth, Peter Clapham, Richard Clark, Adrian Clarke, Catriona Clarke, Daryl Cole, Elizabeth Cook, Maria Coppola, Linda Cornell, Clare Cornwell, Craig Corton, Abby Crackett, Alison Cranage, Harriet Craven, Sarah Craw, Mark Crawford, Tim Cutts, Monika Dabrowska, Matt Davies, Robert Davies, Joseph Dawson, Callum Day, Aiden Densem, Thomas Dibling, Cat Dockree, David Dodd, Sunil Dogga, Matthew Dorman, Gordon Dougan, Martin Dougherty, Alexander Dove, Lucy Drummond, Eleanor Drury, Monika Dudek, Jillian Durham, Laura Durrant, Elizabeth Easthope, Sabine Eckert, Pete Ellis, Ben Farr, Michael Fenton, Marcella Ferrero, Neil Flack, Howerd Fordham, Grace Forsythe, Luke Foulser, Matt Francis, Audrey Fraser, Adam Freeman, Anastasia Galvin, Maria Garcia-Casado, Alex Gedny, Sophia Girgis, James Glover, Sonia Goncalves, Scott Goodwin, Oliver Gould, Marina Gourtovaia, Andy Gray, Emma Gray, Coline Griffiths, Yong Gu, Florence Guerin, Will Hamilton, Hannah Hanks, Ewan Harrison, Alexandria Harrott, Edward Harry, Julia Harvison, Paul Heath, Anastasia Hernandez-Koutoucheva, Rhiannon Hobbs, Dave Holland, Sarah Holmes, Gary Hornett, Nicholas Hough, Liz Huckle, Lena Hughes-Hallet, Adam Hunter, Stephen Inglis, Sameena Iqbal, Adam Jackson, David Jackson, Keith James, Dorota Jamrozy, Carlos Jimenez Verdejo, Matthew Jones, Kalyan Kallepally, Leanne Kane, Keely Kay, Sally Kay, Jon Keatley, Alan Keith, Alison King, Lucy Kitchin, Matt Kleanthous, Martina Klimekova, Petra Korlevic, Ksenia Krasheninnkova, Greg Lane, Cordelia Langford, Adam Laverack, Katharine Law, Mara Lawniczak, Stefanie Lensing, Steven Leonard, Laura Letchford, Kevin Lewis, Amanah Lewis-Wade, Jennifer Liddle, Quan Lin, Sarah Lindsay, Sally Linsdell, Rich Livett, Stephanie Lo, Rhona Long, Jamie Lovell, Jon Lovell, Catherine Ludden, James Mack, Mark Maddison, Aleksei Makunin, Irfan Mamun, Jenny Mansfield, Neil Marriott, Matt Martin, Matthew Mayho, Shane McCarthy, Jo McClintock, Samantha McGuigan, Sandra McHugh, Liz McMinn, Carl Meadows, Emily Mobley, Robin Moll, Maria Morra, Leanne Morrow, Kathryn Murie, Sian Nash, Claire Nathwani, Plamena Naydenova, Alexandra Neaverson, Rachel Nelson, Ed Nerou, Jon Nicholson, Tabea Nimz, Guillaume G. Noell, Sarah O’Meara, Valeriu Ohan, Karen Oliver, Charles Olney, Doug Ormond, Agnes Oszlanczi, Steve Palmer, Yoke Fei Pang, Barbora Pardubska, Naomi Park, Aaron Parmar, Gaurang Patel, Minal Patel, Maggie Payne, Arabella Petersen, Deborah Plowman, Tom Preston, Liam Prestwood, Christoph Puethe, Michael Quail, Diana Rajan, Shavanthi Rajatileka, Richard Rance, Suzannah Rawlings, Nicholas Redshaw, Joe Reynolds, Mark Reynolds, Simon Rice, Matt Richardson, Connor Roberts, Katrina Robinson, Melanie Robinson, David Robinson, Hazel Rogers, Eduardo Martin Rojo, Daljit Roopra, Mark Rose, Luke Rudd, Nicholas Salmon, David Saul, Frank Schwach, Carol Scott, Phil Seekings, Lesley Shirley, Alison Simms, Matthew Sinnott, Shanthi Sivadasan, Bart Siwek, Dale Sizer, Kenneth Skeldon, Jason Skelton, Joanna Slater-Tunstill, Lisa Sloper, Nathalie Smerdon, Chris Smith, Christen Smith, James Smith, Katie Smith, Michelle Smith, Sean Smith, Tina Smith, Leighton Sneade, Carmen Diaz Soria, Emily Souster, Andrew Sparkes, Michael Spencer-Chapman, Janet Squares, Robert Stanley, Claire Steed, Tim Stickland, Ian Still, Michael R. Stratton, Michelle Strickland, Allen Swann, Agnieszka Swiatkowska, Neil Sycamore, Emma Swift, Edward Symons, Suzanne Szluha, Emma Taluy, Nunu Tao, Katy Taylor, Sam Taylor, Stacey Thompson, Mark Thompson, Mark Thomson, Nicholas Thomson, Scott Thurston, Gerry Tonkin-Hill, Dee Toombs, Benjamin Topping, Jaime Tovar-Corona, Daniel Ungureanu, James Uphill, Jana Urbanova, Philip Jansen Van Vuuren, Valerie Vancollie, Paul Voak, Danielle Walker, Matthew Walker, Matt Waller, Gary Ward, Charlie Weatherhogg, Niki Webb, Danni Weldon, Alan Wells, Eloise Wells, Luke Westwood, Theo Whipp, Thomas Whiteley, Georgia Whitton, Andrew Whitwham, Sara Widaa, Mia Williams, Mark Wilson, Sean Wright, Ewan M. Harrison, Sónia Gonçalves, David M. Aanensen, Dinesh Aggarwal, Anthony P. Underwood, Cordelia F. Langford, Dominic Kwiatkowski, Andrew R. Bassett, Michael H. Spencer Chapman, Theo Sanderson, Naomi R. Park, Robert M. Davies, Katherine L. Bellis, Matthew J. Dorman, Robin J. Moll, Jaime M. Tovar-Corona, Khalil Abudahab, Ben E. W. Taylor, Jon-Paul Keatley, Iraad F. Bronner, Ben W. Farr, Stefanie V. Lensing, Shane A. McCarthy, Michael A. Quail, Nicholas M. Redshaw, Scott A. J. Thurston, Jennifier Liddle, Dominic Kwiatkowski, Inigo Martincorena & Jeffrey C. Barrett
The Francis Crick Institute, London, UK
Theo Sanderson
Public Health England, London, UK
Frank Schwach, Ian Harrison, Catherine Ludden, Dinesh Aggarwal, Husam Osman, Meera Chand, Sharon J. Peacock, Peter Muir, Esther Robinson, Barry B. Vipond, Andrew Bosworth, Stephanie Hutchings, Hannah M. Pymont, Richard Myers, Alicia Thornton, Katerina Galai, Shazaad S. Y. Ahmad, Nicholas W. Machin, Richard Hopes, Chloe Bishop, Vicki Chalker, Elias Allara, Clare Pearson, David Bibby, Gavin Dabrera, Nicholas Ellaby, Eileen Gallagher, Jonathan Hubb, Angie Lackenby, David Lee, Nikos Manesis, Tamyo Mbisa, Steven Platt, Katherine A. Twohig & Meera Chand
London School of Hygiene & Tropical Medicine, London, UK
Joel Hellewell & Sebastian Funk
The Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
Jasmina Panovska-Griffiths, Tanya Golubchik, David Bonsall, Katrina Lythgoe, Helen Fryer, Christophe Fraser & Laura Thomson
MRC Centre for Global Infectious Disease Analysis, Jameel Institute for Disease and Emergency Analytics, Imperial College London, London, UK
Erik Volz, Rob Johnson, Olivia Boyd, Lily Geidelberg, Manon Ragonnet-Cronin & Erik Volz
Guy’s and St Thomas’ NHS Foundation Trust, London, UK
Perminder Smith, Adela Alcolea, Natasha Ohemeng-Kumi, John Ramble, Jasveen Sehmi & Meera Chand
Division for AI in Oncology, German Cancer Research Centre DKFZ, Heidelberg, Germany
Moritz Gerstung
Department of Medicine, University of Cambridge, Cambridge, UK
Sharon Peacock, Ramin Sadri, Catarina Sousa, Catherine Ludden, M. Estee Torok, Dinesh Aggarwal, Alessandro M. Carabelli, Sharon J. Peacock, Beth Blane, MacGregor Cox, Patrick Maxwell, Ken Smith, Ellena Brooks, Carol M. Churcher, Mireille Fragakis, Katerina Galai, Andrew Jermy, Sarah Judges, Georgina M. McManus, Kim S. Smith, Elaine Westwick & Theresa Feltwell
Centre for Enzyme Innovation, University of Portsmouth, Portsmouth, UK
Samuel C. Robson, Angela H. Beckett & Ekaterina Shelest
School of Pharmacy & Biomedical Sciences, University of Portsmouth, Portsmouth, UK
Samuel C. Robson, Katie F. Loveson, Kate F. Cook, Christopher Fearn & Salman Goudarzi
Cardiff University, Cardiff, UK
Thomas R. Connor, Anna Price, Andrew Mack, Arthur Morriss, Thomas Whalley, Joel Southgate, Martyn Guest, Christine Kitchen, Ian Merrick, Robert Munn, Owen Jones, Catherine Bresner, William Fuller, Angela Marchbank & Trudy Workman
Public Health Wales, Cardiff, UK
Thomas R. Connor, Sally Corden, Catherine Moore, Joanne Watkins, Matthew Bull, Catryn Williams, Nicole Pacchiarini, Simon Cottrell, Sara Rey, David Heyburn, Chris Williams, Noel Craine, Malorie Perry, Alexander Adams, Tara Annett, Hibo Asad, Alec Birchley, Jason Coombes, Johnathan M. Evans, Laia Fina, Bree Gatica-Wilcox, Lauren Gilbert, Lee Graham, Jessica Hey, Ember Hilvers, Sophie Jones, Hannah Jones, Sara Kumziene-Summerhayes, Caoimhe McKerr, Jessica Powell, Georgia Pugh, Sarah Taylor, Joel Southgate, Alisha Davies, Elen De Lacy, Fatima Downing, Sue Edwards, Laura Gifford, Mari Morgan & Amy Gaskin
Institute of Microbiology and Infection, University of Birmingham, Birmingham, UK
Nicholas J. Loman, Sam Nicholls, Claire McMurray, Alan McNally, Joshua Quick, Radoslaw Poplawski, Joanne Stockton & Will Rowe
King’s College London, London, UK
Rocio T. Martinez Nunez & Themoula Charalampous
University of Edinburgh, Edinburgh, UK
Andrew Rambaut, Kate E. Templeton, Áine O’Toole, Thomas Williams, Rachel Colquhoun, Verity Hill, Ben Jackson, J. T. McCrone, Nathan Medd, Emily Scher, Carlos E. Balcazar, Michael D. Gallagher, Daniel Maloney, Thomas D. Stanton & Kathleen A. Williamson
Centre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, Guy’s and St Thomas’ NHS Foundation Trust, London, UK
Luke B. Snell, Gaia Nebbia, Jonathan Edgeworth, Rahul Batra, Bindi Patel, Themoula Charalampous & Amita Patel
University College London Hospitals NHS Foundation Trust, London, UK
Eleni Nastouli, Catherine Houlihan, Matthew Byott, Sunando Roy, Judith Heaney, Judith Breuer, Moira J. Spyer, Rachel J. Williams, Charlotte A. Williams, John A. Hartley, Leah Ensell, Helen L. Lowe, Laurentiu Maftei, Matteo Mondani, Nadua Bayzid & Marius Cotic
Advanced Pathogen Diagnostics Unit, University College London Hospital, London, UK
Eleni Nastouli, Dan Frampton, Matthew Byott, Judith Heaney & Moira J. Spyer
Great Ormond Street Institute of Child Health, University College London, London, UK
Eleni Nastouli & Moira J. Spyer
Department of Infectious Diseases and Microbiology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
M. Estee Torok & William L. Hamilton
Division of Virology, Department of Pathology, University of Cambridge, Cambridge, UK
Ian G. Goodfellow, Yasmin Chaudhry, Rhys Izuagbe, Aminu S. Jahun, Iliana Georgana, Myra Hosmillo & Malte L. Pinckert
University Hospital Southampton NHS Foundation Trust, Southampton, UK
Jacqui A. Prieto, Kordo Saeed, Stephen Aplin, Matthew Harvey, Thea Sass, Helen Umpleby, Helen Wheeler, Emanuela Pelosi, Siona Silviera, Eleri Wilson-Davies, Adhyana I. K. Mahanama, Buddhini Samaraweera & Sarah Jeremiah
School of Health Sciences, University of Southampton, Southampton, UK
Jacqui A. Prieto
School of Medicine, University of Southampton, Southampton, UK
Kordo Saeed
Division of Infection and Immunity, University College London, London, UK
Catherine Houlihan & Dan Frampton
University of Brighton, Brighton, UK
Giselda Bucca, Colin P. Smith & Andrew R. Hesketh
Institute for Infection and Immunity, St George’s University of London, London, UK
Adam A. Witney, Cassie F. Pope, Kenneth G. Laing & Irene M. Monahan
Infection Care Group, St George’s University Hospitals NHS Foundation Trust, London, UK
Cassie F. Pope
MRC–University of Glasgow Centre for Virus Research, Glasgow, UK
Emma C. Thomson, James G. Shepherd, David L. Robertson, William T. Harvey, Joseph Hughes, Richard J. Orton, Ana da Silva Filipe, Igor Starinskij, Sreenu Vattipally, Afrida Mukaddas, Derek W. Wright, Alice Broos, Daniel Mair, Jenna Nichols, Kyriaki Nomikou, Lily Tong & Ioulia Tsatsani
University of Cambridge, Cambridge, UK
Ewan M. Harrison, Ravi K. Gupta, Michael H. Spencer Chapman, Chris Ruis, Sophia T. Girgis, Katherine L. Bellis, Leanne M. Kermack, Claire Cormie, Joana Dias, Sally Forrest, Ellen E. Higginson, Mailis Maes, Jamie Young, Elias Allara, Clare Pearson & Shane A. McCarthy
Queen’s University Belfast, Belfast, UK
Fiona Rogan, Derek J. Fairley, David A. Simpson, Marc Fuchs, Julia Miskelly, Stephen Bridgett, Timofey Skvortsov, Declan T. Bradley, Zoltan Molnar, Chris Baxter, Sílvia F. Carvalho, Deborah Lavin, Arun Mariappan, Clara Radulescu, Aditi Singh & Miao Tang
Blackpool Teaching Hospitals NHS Foundation Trust, Blackpool, UK
Shaun M. Beckwith, Abigail Murray & Dawn Singleton
Hull University Teaching Hospitals NHS Trust, Hull, UK
Kirstine Eastick, Patrick J. Lillie, Phillipa J. Burns, Kavitha Gajee, Katie Kitchman & William Everson
University Hospitals Dorset NHS Foundation Trust, Poole, UK
Liz A. Sheridan, Dorian Crudgington & Ben Macklin
University Hospitals Sussex NHS Foundation Trust, Worthing, UK
Paul Randell, Mohammed O. Hassan-Ibrahim, Michelle J. Erkiert, Nicola J. Chaloner, Benjamin J. Cogger, Lisa J. Easton, Hannah Huckson, Jonathan Lewis, Sarah Lowdon, Cassandra S. Malone, Florence Munemo, Manasa Mutingwende, Roberto Nicodemi, Olga Podplomyk & Thomas Somassa
University of Exeter, Exeter, UK
Leigh M. Jackson, Ben Temperton, Stephen L. Michell, Aaron R. Jeffries, Christopher R. Jones, Bridget A. Knight, Robin Manley, Michelle L. Michelsen, Christine M. Sambles, David J. Studholme & Joanna Warwick-Dugdale
Belfast Health & Social Care Trust, Belfast, UK
Derek J. Fairley, James P. McKenna & Tanya Curran
Deep Seq, School of Life Sciences, Queens Medical Centre, University of Nottingham, Nottingham, UK
Matthew W. Loose, Matthew Carlile, Nadine Holmes, Christopher Moore, Victoria Wright, Johnny Debebe & Fei Sang
East Kent Hospitals University NHS Foundation Trust, Canterbury, UK
Samuel Moses & Hannah Lowe
University of Kent, Canterbury, UK
Samuel Moses
Hub for Biotechnology in the Built Environment, Northumbria University, Newcastle upon Tyne, UK
Darren L. Smith, Matthew Bashton & Gregory R. Young
Northumbria University, Newcastle upon Tyne, UK
Darren L. Smith, Matthew Bashton, Gregory R. Young, Clare M. McCann, Andrew Nelson, Matthew R. Crown, John H. Henderson, Amy Hollis, William Stanley & Wen C. Yew
NU-OMICS, Northumbria University, Newcastle upon Tyne, UK
Darren L. Smith, Clare M. McCann & Andrew Nelson
Centre for Genomic Pathogen Surveillance, University of Oxford, Oxford, UK
David M. Aanensen, Anthony P. Underwood, Khalil Abudahab, Mirko Menegazzo, Ben E. W. Taylor & Corin A. Yeats
Public Health England, Cambridge, Cambridge, UK
Martin D. Curran & Surendra Parmar
University of Sheffield, Sheffield, UK
Matthew D. Parker, Thushan I. de Silva, Dennis Wang, Mehmet Yavus, Samantha E. Hansford, Benjamin H. Foulkes, Marta Gallis, Hailey R. Hornsby, Stavroula F. Louka, Manoj Pohare, Paige Wolverson, Peijun Zhang, Timothy M. Freeman, Sharon N. Hsu, Benjamin B. Lindsey, Nikki Smith, Cariad Evans, Kate Johnson, David G. Partridge, Mohammad Raza, Alexander J. Keeley, Adrienn Angyal, Luke R. Green & Max Whiteley
Portsmouth Hospitals University NHS Trust, Portsmouth, UK
Sharon Glaysher, Scott Elliott, Kelly Bicknell, Robert Impey & Sarah Wyllie
NHS Lothian, Edinburgh, UK
Kate E. Templeton, Rebecca Dewar, Martin P. McHugh, Elizabeth Wastnedge, Seb Cotton & Abbie Gallagher
NHS Greater Glasgow and Clyde, Glasgow, UK
Rory N. Gunson, Rachel Blacow & Jon Perkins
Quadram Institute Bioscience, Norwich, UK
Justin O’Grady, Andrew J. Page, Gemma L. Kay, Nabil-Fareed Alikhan, Alexander J. Trotter, Leonardo de Oliveira Martins, Alison E. Mather, Lizzie Meadows, Alp Aydin, David J. Baker, Ebenezer Foster-Nyarko, Sophie J. Prosolek, Steven Rudder & Thanh Le-Viet
University of East Anglia, Norwich, UK
Justin O’Grady & Rose K. Davidson
University of Oxford, Oxford, UK
Dominic Kwiatkowski
Hampshire Hospitals NHS Foundation Trust, Basingstoke, UK
Nicholas Cortes, Nathan Moore, Claire Thomas, Stephen P. Kidd, Beatrice Bertolusso, Jessica Lynch, Gabrielle Vernet & Rebecca Williams
Royal Free London NHS Foundation Trust, London, UK
Tabitha W. Mahungu, Tanzina Haque, Jennifer Hart, Dianne Irish-Tavares & Eric Witele
South Tees Hospitals NHS Foundation Trust, Middlesbrough, UK
Steven Liggett, Craig Mower, Louisa K. Watson, Paul Baker, Stephen Bonner, Sarah Essex & Leanne J. Murray
School of Biological Sciences, University of Portsmouth, Portsmouth, UK
Angela H. Beckett, Yann Bourgeois & Ekaterina Shelest
Public Health Scotland, Edinburgh, UK
Matthew T. G. Holden, Sharif Shaaban, Stefan Rooke & Gonzalo Yebra
Health Services Laboratories, London, UK
Lisa J. Levett, Judith Heaney, Paul R. Grant, Stuart Kirk, Wendy Chatterton & Monika Pusok
Heartlands Hospital, Birmingham, UK
Husam Osman, Esther Robinson, Li Xu-McCrae & Andrew Bosworth
University of Liverpool, Liverpool, UK
Alistair C. Darby, Steve Paterson, Miren Iturriza-Gomara, Anita O. Lucaci, Sam T. Haldenby, Kathryn A. Jackson, Lance Turtle, Richard Eccles, Matthew Gemmell, Richard Gregory, Margaret Hughes, Charlotte Nelson, Lucille Rainbow, Edith E. Vamos, Hermione J. Webster, Mark Whitehead & Claudia Wierzbicki
Department of Zoology, University of Oxford, Oxford, UK
Oliver G. Pybus, Louis du Plessis, Moritz U. G. Kraemer, Jayna Raghwani, Alex E. Zarebski, Stephen W. Attwood, Marina Escalera Zamudio, Sarah Francois, Bernardo Gutierrez & Tetyana I. Vasylyeva
MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
Daniela de Angelis, Chris J. Illingworth, Chris Jackson & David Pascall
Microbiology Department, Buckinghamshire Healthcare NHS Trust, Aylesbury, UK
Nick Wong, David T. Pritchard, Debbie Binns & Victoria James
The Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
Yusri Taha, Jennifer Collins, Gary Eltringham, Shirelle Burton-Fanning, Brendan A. I. Payne & Sheila Waugh
University of St Andrews, St Andrews, UK
Martin P. McHugh
Imperial College Healthcare NHS Trust, London, UK
Siddharth Mookerjee, Alison Cox, Pinglawathee Madona, Marcus Pond, Paul A. Randell, Alison H. Holmes, James R. Price & Frances Bolt
NIHR Health Protection Research Unit in HCAI and AMR, Imperial College London, London, UK
Siddharth Mookerjee, Alison H. Holmes, James R. Price & Frances Bolt
Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
Ben Warne
Department of Microbiology, South West London Pathology, London, UK
Joshua F. Taylor & Ngee Keong Tan
Betsi Cadwaladr University Health Board, Wrexham, UK
Helen Adams
Institute of Biodiversity, Animal Health & Comparative Medicine, Glasgow, UK
William T. Harvey & Katherine L. Smollett
Norfolk County Council, Norfolk, UK
Lewis G. Spurgin & Louise Smith
Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
Judith Breuer, Julianne R. Brown, Kathryn A. Harris, Nathaniel Storey, Laura Atkinson, Jack C. D. Lee & Divya Shah
Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
John A. Todd, David Buck, Angie Green, George MacIntyre-Cockett & Amy Trebes
Cardiff and Vale University Health Board, Cardiff, UK
Michaela John, Sian Morgan, Safiah Afifi, Robert Beer, Joshua Maksimovic, Kathryn McCluggage Masters & Karla Spellman
Turnkey Laboratory, University of Birmingham, Birmingham, UK
Alan McNally, Fiona Ashford, Angus Best, Liam Crawford, Nicola Cumley, Megan Mayhew, Oliver Megram, Jeremy Mirza, Emma Moles-Garcia & Benita Percival
Norfolk and Norwich University Hospitals NHS Foundation Trust, Norwich, UK
Samir Dervisevic, Rachael Stanley & Lindsay Coupland
Royal Brompton and Harefield Hospitals, London, UK
Newara A. Ramadan
The Queen Elizabeth Hospital King’s Lynn NHS Foundation Trust, King’s Lynn, UK
Christopher Jeanes
Whittington Health NHS Trust, London, UK
Jana Catalan & Neil Jones
Barking, Havering and Redbridge University Hospitals NHS Trust, London, UK
Amy Ash & Cherian Koshy
Bournemouth University, Bournemouth, UK
Magdalena Barrow, Sarah L. Buchan & Anna Mantzouratou
Clinical Microbiology Department, Queens Medical Centre, Nottingham University Hospitals NHS Trust, Nottingham, UK
Gemma Clark, Louise Berry, Tim Boswell, Vicki M. Fleming, Hannah C. Howson-Wells, Amelia Joseph, Manjinder Khakh & Michelle M. Lister
Clinical Microbiology, University Hospitals of Leicester NHS Trust, Leicester, UK
Christopher W. Holmes, Paul W. Bird, Karlie Fallon, Thomas Helmer, Claire L. McMurray, Mina Odedra, Jessica Shaw, Julian W. Tang & Nicholas J. Willford
County Durham and Darlington NHS Foundation Trust, Durham, UK
Sharon Campbell, Victoria Blakey, Veena Raviprakash, Nicola Sheriff & Lesley-Anne Williams
Department of Microbiology, Kettering General Hospital, Kettering, UK
Thomas Davis, Sahar Eldirdiri & Anita Kenyon
Barts Health NHS Trust, London, UK
Kathryn A. Harris
North West London Pathology, London, UK
Alison Cox, Pinglawathee Madona, Marcus Pond & Paul A. Randell
Maidstone and Tunbridge Wells NHS Trust, Maidstone, UK
Karen T. Withell & Graciela Sluga
Microbiology, Royal Oldham Hospital, Oldham, UK
Cheryl Williams
North Cumbria Integrated Care NHS Foundation Trust, Carlisle, UK
Clive Graham, Edward Barton, Debra Padgett & Garren Scott
North Tees and Hartlepool NHS Foundation Trust, Stockton on Tees, UK
Rebecca Denton-Smith, Emma Swindells, Robyn Turnbull & Jane Greenaway
Path Links, Northern Lincolnshire and Goole NHS Foundation Trust, Grimsby, UK
Tim J. Sloan, Phillip Clarke, Nichola Duckworth & Sarah Walsh
Queen Elizabeth Hospital, Birmingham, UK
Anna Casey & Liz Ratcliffe
Royal Devon and Exeter NHS Foundation Trust, Exeter, UK
Christopher R. Jones, Bridget A. Knight, Jennifer Poyner, Cressida Auckland & Helen Morcrette
Virology, School of Life Sciences, Queens Medical Centre, University of Nottingham, Nottingham, UK
Patrick C. McClure, Jonathan Ball, Timothy Byaruhanga, Joseph G. Chappell, Jayasree Dey, Jack D. Hill & Emily J. Park
Swansea University, Swansea, UK
Ronan A. Lyons
Viapath, Guy’s and St Thomas’ NHS Foundation Trust, and King’s College Hospital NHS Foundation Trust, London, UK
Perminder Smith, Adela Alcolea, Natasha Ohemeng-Kumi, John Ramble & Jasveen Sehmi
Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, UK
Mehmet Yavus, Cariad Evans, Kate Johnson, David G. Partridge & Mohammad Raza
Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
Nicola Reynolds
West of Scotland Specialist Virology Centre, NHS Greater Glasgow and Clyde, Glasgow, UK
Lynne Ferguson, Emily J. Goldstein, Alasdair Maclean & Rachael Tomb
Public Health Agency, Belfast, UK
Tim Wyatt & Declan T. Bradley
Northumbria Healthcare NHS Foundation Trust, Newcastle upon Tyne, UK
Giles Idle & Kevin Cole
University of Southampton, Southampton, UK
Matilde Mori
East Suffolk and North Essex NHS Foundation Trust, Colchester, UK
Luke Bedford
East Sussex Healthcare NHS Trust, St Leonards-on-Sea, UK
James S. Cargill & Warwick Hughes
Gateshead Health NHS Foundation Trust, Gateshead, UK
Jonathan Moore & Susanne Stonehouse
Isle of Wight NHS Trust, Newport, UK
Anibolina Castigador & Emily Macnaughton
King’s College Hospital NHS Foundation Trust, London, UK
Kate El Bouzidi, Temi Lampejo & Malur Sudhanva
Liverpool Clinical Laboratories, Liverpool, UK
Cassie Breen
Manchester University NHS Foundation Trust, Manchester, UK
Shazaad S. Y. Ahmad, Ryan P. George & Nicholas W. Machin
North Middlesex University Hospital NHS Trust, London, UK
Aidan Cross & Mariyam Mirfenderesky
Southwest Pathology Services, Taunton, UK
Andrew I. Lawton
The Royal Marsden NHS Foundation Trust, London, UK
Andrea N. Gomes, Maimuna Kimuli & Darren R. Murray
The Royal Wolverhampton NHS Trust, Wolverhampton, UK
Paula Ashfield & Donald Dobie
University of Birmingham, Birmingham, UK
Andrew Beggs & Alex Richter
Watford General Hospital, Watford, UK
Arezou Fanaie, Rachel A. Hilson & Geraldine Yaze
Guy’s and St Thomas’ Biomedical Research Centre, London, UK
Flavia Flaviani
Newcastle University, Newcastle, UK
Sarah O’Brien, Steven Rushton & Roy Sanderson

Authors

Harald S. Vöhringer
View author publications
You can also search for this author in PubMed Google Scholar
Theo Sanderson
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Sinnott
View author publications
You can also search for this author in PubMed Google Scholar
Nicola De Maio
View author publications
You can also search for this author in PubMed Google Scholar
Thuy Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Richard Goater
View author publications
You can also search for this author in PubMed Google Scholar
Frank Schwach
View author publications
You can also search for this author in PubMed Google Scholar
Ian Harrison
View author publications
You can also search for this author in PubMed Google Scholar
Joel Hellewell
View author publications
You can also search for this author in PubMed Google Scholar
Cristina V. Ariani
View author publications
You can also search for this author in PubMed Google Scholar
Sonia Gonçalves
View author publications
You can also search for this author in PubMed Google Scholar
David K. Jackson
View author publications
You can also search for this author in PubMed Google Scholar
Ian Johnston
View author publications
You can also search for this author in PubMed Google Scholar
Alexander W. Jung
View author publications
You can also search for this author in PubMed Google Scholar
Callum Saint
View author publications
You can also search for this author in PubMed Google Scholar
John Sillitoe
View author publications
You can also search for this author in PubMed Google Scholar
Maria Suciu
View author publications
You can also search for this author in PubMed Google Scholar
Nick Goldman
View author publications
You can also search for this author in PubMed Google Scholar
Jasmina Panovska-Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
Ewan Birney
View author publications
You can also search for this author in PubMed Google Scholar
Erik Volz
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Funk
View author publications
You can also search for this author in PubMed Google Scholar
Dominic Kwiatkowski
View author publications
You can also search for this author in PubMed Google Scholar
Meera Chand
View author publications
You can also search for this author in PubMed Google Scholar
Inigo Martincorena
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey C. Barrett
View author publications
You can also search for this author in PubMed Google Scholar
Moritz Gerstung
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

The Wellcome Sanger Institute COVID-19 Surveillance Team

Irina Abnizova
, Louise Aigrain
, Alex Alderton
, Mozam Ali
, Laura Allen
, Roberto Amato
, Ralph Anderson
, Cristina Ariani
, Siobhan Austin-Guest
, Sendu Bala
, Jeffrey Barrett
, Andrew Bassett
, Kristina Battleday
, James Beal
, Mathew Beale
, Charlotte Beaver
, Sam Bellany
, Tristram Bellerby
, Katie Bellis
, Duncan Berger
, Matt Berriman
, Emma Betteridge
, Paul Bevan
, Simon Binley
, Jason Bishop
, Kirsty Blackburn
, James Bonfield
, Nick Boughton
, Sam Bowker
, Timothy Brendler-Spaeth
, Iraad Bronner
, Tanya Brooklyn
, Sarah Kay Buddenborg
, Robert Bush
, Catarina Caetano
, Alex Cagan
, Nicola Carter
, Joanna Cartwright
, Tiago Carvalho Monteiro
, Liz Chapman
, Tracey-Jane Chillingworth
, Peter Clapham
, Richard Clark
, Adrian Clarke
, Catriona Clarke
, Daryl Cole
, Elizabeth Cook
, Maria Coppola
, Linda Cornell
, Clare Cornwell
, Craig Corton
, Abby Crackett
, Alison Cranage
, Harriet Craven
, Sarah Craw
, Mark Crawford
, Tim Cutts
, Monika Dabrowska
, Matt Davies
, Robert Davies
, Joseph Dawson
, Callum Day
, Aiden Densem
, Thomas Dibling
, Cat Dockree
, David Dodd
, Sunil Dogga
, Matthew Dorman
, Gordon Dougan
, Martin Dougherty
, Alexander Dove
, Lucy Drummond
, Eleanor Drury
, Monika Dudek
, Jillian Durham
, Laura Durrant
, Elizabeth Easthope
, Sabine Eckert
, Pete Ellis
, Ben Farr
, Michael Fenton
, Marcella Ferrero
, Neil Flack
, Howerd Fordham
, Grace Forsythe
, Luke Foulser
, Matt Francis
, Audrey Fraser
, Adam Freeman
, Anastasia Galvin
, Maria Garcia-Casado
, Alex Gedny
, Sophia Girgis
, James Glover
, Sonia Goncalves
, Scott Goodwin
, Oliver Gould
, Marina Gourtovaia
, Andy Gray
, Emma Gray
, Coline Griffiths
, Yong Gu
, Florence Guerin
, Will Hamilton
, Hannah Hanks
, Ewan Harrison
, Alexandria Harrott
, Edward Harry
, Julia Harvison
, Paul Heath
, Anastasia Hernandez-Koutoucheva
, Rhiannon Hobbs
, Dave Holland
, Sarah Holmes
, Gary Hornett
, Nicholas Hough
, Liz Huckle
, Lena Hughes-Hallet
, Adam Hunter
, Stephen Inglis
, Sameena Iqbal
, Adam Jackson
, David Jackson
, Keith James
, Dorota Jamrozy
, Carlos Jimenez Verdejo
, Ian Johnston
, Matthew Jones
, Kalyan Kallepally
, Leanne Kane
, Keely Kay
, Sally Kay
, Jon Keatley
, Alan Keith
, Alison King
, Lucy Kitchin
, Matt Kleanthous
, Martina Klimekova
, Petra Korlevic
, Ksenia Krasheninnkova
, Dominic Kwiatkowski
, Greg Lane
, Cordelia Langford
, Adam Laverack
, Katharine Law
, Mara Lawniczak
, Stefanie Lensing
, Steven Leonard
, Laura Letchford
, Kevin Lewis
, Amanah Lewis-Wade
, Jennifer Liddle
, Quan Lin
, Sarah Lindsay
, Sally Linsdell
, Rich Livett
, Stephanie Lo
, Rhona Long
, Jamie Lovell
, Jon Lovell
, Catherine Ludden
, James Mack
, Mark Maddison
, Aleksei Makunin
, Irfan Mamun
, Jenny Mansfield
, Neil Marriott
, Matt Martin
, Inigo Martincorena
, Matthew Mayho
, Shane McCarthy
, Jo McClintock
, Samantha McGuigan
, Sandra McHugh
, Liz McMinn
, Carl Meadows
, Emily Mobley
, Robin Moll
, Maria Morra
, Leanne Morrow
, Kathryn Murie
, Sian Nash
, Claire Nathwani
, Plamena Naydenova
, Alexandra Neaverson
, Rachel Nelson
, Ed Nerou
, Jon Nicholson
, Tabea Nimz
, Guillaume G. Noell
, Sarah O’Meara
, Valeriu Ohan
, Karen Oliver
, Charles Olney
, Doug Ormond
, Agnes Oszlanczi
, Steve Palmer
, Yoke Fei Pang
, Barbora Pardubska
, Naomi Park
, Aaron Parmar
, Gaurang Patel
, Minal Patel
, Maggie Payne
, Sharon Peacock
, Arabella Petersen
, Deborah Plowman
, Tom Preston
, Liam Prestwood
, Christoph Puethe
, Michael Quail
, Diana Rajan
, Shavanthi Rajatileka
, Richard Rance
, Suzannah Rawlings
, Nicholas Redshaw
, Joe Reynolds
, Mark Reynolds
, Simon Rice
, Matt Richardson
, Connor Roberts
, Katrina Robinson
, Melanie Robinson
, David Robinson
, Hazel Rogers
, Eduardo Martin Rojo
, Daljit Roopra
, Mark Rose
, Luke Rudd
, Ramin Sadri
, Nicholas Salmon
, David Saul
, Frank Schwach
, Carol Scott
, Phil Seekings
, Lesley Shirley
, John Sillitoe
, Alison Simms
, Matthew Sinnott
, Shanthi Sivadasan
, Bart Siwek
, Dale Sizer
, Kenneth Skeldon
, Jason Skelton
, Joanna Slater-Tunstill
, Lisa Sloper
, Nathalie Smerdon
, Chris Smith
, Christen Smith
, James Smith
, Katie Smith
, Michelle Smith
, Sean Smith
, Tina Smith
, Leighton Sneade
, Carmen Diaz Soria
, Catarina Sousa
, Emily Souster
, Andrew Sparkes
, Michael Spencer-Chapman
, Janet Squares
, Robert Stanley
, Claire Steed
, Tim Stickland
, Ian Still
, Michael R. Stratton
, Michelle Strickland
, Allen Swann
, Agnieszka Swiatkowska
, Neil Sycamore
, Emma Swift
, Edward Symons
, Suzanne Szluha
, Emma Taluy
, Nunu Tao
, Katy Taylor
, Sam Taylor
, Stacey Thompson
, Mark Thompson
, Mark Thomson
, Nicholas Thomson
, Scott Thurston
, Gerry Tonkin-Hill
, Dee Toombs
, Benjamin Topping
, Jaime Tovar-Corona
, Daniel Ungureanu
, James Uphill
, Jana Urbanova
, Philip Jansen Van Vuuren
, Valerie Vancollie
, Paul Voak
, Danielle Walker
, Matthew Walker
, Matt Waller
, Gary Ward
, Charlie Weatherhogg
, Niki Webb
, Danni Weldon
, Alan Wells
, Eloise Wells
, Luke Westwood
, Theo Whipp
, Thomas Whiteley
, Georgia Whitton
, Andrew Whitwham
, Sara Widaa
, Mia Williams
, Mark Wilson
& Sean Wright

The COVID-19 Genomics UK (COG-UK) Consortium*

Funding acquisition, leadership and supervision, metadata curation, project administration, samples and logistics, sequencing and analysis, software and analysis tools, and visualization
- Samuel C. Robson
Funding acquisition, leadership and supervision, metadata curation, project administration, samples and logistics, sequencing and analysis, and software and analysis tools
- Thomas R. Connor
- & Nicholas J. Loman
Leadership and supervision, metadata curation, project administration, samples and logistics, sequencing and analysis, software and analysis tools, and visualization
- Tanya Golubchik
Funding acquisition, leadership and supervision, metadata curation, samples and logistics, sequencing and analysis, and visualization
- Rocio T. Martinez Nunez
Funding acquisition, leadership and supervision, project administration, samples and logistics, sequencing and analysis, and software and analysis tools
- David Bonsall
Funding acquisition, leadership and supervision, project administration, sequencing and analysis, software and analysis tools, and visualization
- Andrew Rambaut
Funding acquisition, metadata curation, project administration, samples and logistics, sequencing and analysis, and software and analysis tools
- Luke B. Snell
Leadership and supervision, metadata curation, project administration, samples and logistics, software and analysis tools, and visualization
- Rich Livett
Funding acquisition, leadership and supervision, metadata curation, project administration, and samples and logistics
- Catherine Ludden
Funding acquisition, leadership and supervision, metadata curation, samples and logistics, and sequencing and analysis
- Sally Corden
- & Eleni Nastouli
Funding acquisition, leadership and supervision, metadata curation, sequencing and analysis, and software and analysis tools
- Gaia Nebbia
Funding acquisition, leadership and supervision, project administration, samples and logistics, and sequencing and analysis
- Ian Johnston
Leadership and supervision, metadata curation, project administration, samples and logistics, and sequencing and analysis
- Katrina Lythgoe
- , M. Estee Torok
- & Ian G. Goodfellow
Leadership and supervision, metadata curation, project administration, samples and logistics, and visualization
- Jacqui A. Prieto
- & Kordo Saeed
Leadership and supervision, metadata curation, project administration, sequencing and analysis, and software and analysis tools
- David K. Jackson
Leadership and supervision, metadata curation, samples and logistics, sequencing and analysis, and visualization
- Catherine Houlihan
Leadership and supervision, metadata curation, sequencing and analysis, software and analysis tools, and visualization
- Dan Frampton
Metadata curation, project administration, samples and logistics, sequencing and analysis, and software and analysis tools
- William L. Hamilton
- & Adam A. Witney
Funding acquisition, samples and logistics, sequencing and analysis, and visualization
- Giselda Bucca
Funding acquisition, leadership and supervision, metadata curation and project administration
- Cassie F. Pope
Funding acquisition, leadership and supervision, metadata curation, and samples and logistics
- Catherine Moore
Funding acquisition, leadership and supervision, metadata curation, and sequencing and analysis
- Emma C. Thomson
Funding acquisition, leadership and supervision, project administration, and samples and logistics
- Ewan M. Harrison
Funding acquisition, leadership and supervision, sequencing and analysis, and visualization
- Colin P. Smith
Leadership and supervision, metadata curation, project administration, and sequencing and analysis
- Fiona Rogan
Leadership and supervision, metadata curation, project administration, and samples and logistics
- Shaun M. Beckwith
- , Abigail Murray
- , Dawn Singleton
- , Kirstine Eastick
- , Liz A. Sheridan
- , Paul Randell
- , Leigh M. Jackson
- , Cristina V. Ariani
- & Sónia Gonçalves
Leadership and supervision, metadata curation, samples and logistics, and sequencing and analysis
- Derek J. Fairley
- , Matthew W. Loose
- & Joanne Watkins
Leadership and supervision, metadata curation, samples and logistics, and visualization
- Samuel Moses
Leadership and supervision, metadata curation, sequencing and analysis, and software and analysis tools
- Sam Nicholls
- , Matthew Bull
- & Roberto Amato
Leadership and supervision, project administration, samples and logistics, and sequencing and analysis
- Darren L. Smith
Leadership and supervision, sequencing and analysis, software and analysis tools, and visualization
- David M. Aanensen
- & Jeffrey C. Barrett
Metadata curation, project administration, samples and logistics, and sequencing and analysis
- Dinesh Aggarwal
- , James G. Shepherd
- , Martin D. Curran
- & Surendra Parmar
Metadata curation, project administration, sequencing and analysis, and software and analysis tools
- Matthew D. Parker
Metadata curation, samples and logistics, sequencing and analysis, and software and analysis tools
- Catryn Williams
Metadata curation, samples and logistics, sequencing and analysis, and visualization
- Sharon Glaysher
Metadata curation, sequencing and analysis, software and analysis tools, and visualization
- Anthony P. Underwood
- , Matthew Bashton
- , Nicole Pacchiarini
- , Katie F. Loveson
- & Matthew Byott
Project administration, sequencing and analysis, software and analysis tools, and visualization
- Alessandro M. Carabelli
Funding acquisition, leadership and supervision, and metadata curation
- Kate E. Templeton
Funding acquisition, leadership and supervision, and project administration
- Thushan I. de Silva
- , Dennis Wang
- , Cordelia F. Langford
- & John Sillitoe
Funding acquisition, leadership and supervision, and samples and logistics
- Rory N. Gunson
Funding acquisition, leadership and supervision, and sequencing and analysis
- Simon Cottrell
- , Justin O’Grady
- & Dominic Kwiatkowski
Leadership and supervision, metadata curation and project administration
- Patrick J. Lillie
Leadership and supervision, metadata curation, and samples and logistics
- Nicholas Cortes
- , Nathan Moore
- , Claire Thomas
- , Phillipa J. Burns
- , Tabitha W. Mahungu
- & Steven Liggett
Leadership and supervision, metadata curation, and sequencing and analysis
- Angela H. Beckett
- & Matthew T. G. Holden
Leadership and supervision, project administration, and samples and logistics
- Lisa J. Levett
- , Husam Osman
- & Mohammed O. Hassan-Ibrahim
Leadership and supervision, project administration, and sequencing and analysis
- David A. Simpson
Leadership and supervision, samples and logistics, and sequencing and analysis
- Meera Chand
- , Ravi K. Gupta
- , Alistair C. Darby
- & Steve Paterson
Leadership and supervision, sequencing and analysis, and software and analysis tools
- Oliver G. Pybus
- , Erik Volz
- , Daniela de Angelis
- , David L. Robertson
- , Andrew J. Page
- & Inigo Martincorena
Leadership and supervision, sequencing and analysis, and visualization
- Louise Aigrain
- & Andrew R. Bassett
Metadata curation, project administration, and samples and logistics
- Nick Wong
- , Yusri Taha
- , Michelle J. Erkiert
- & Michael H. Spencer Chapman
Metadata curation, project administration, and sequencing and analysis
- Rebecca Dewar
- & Martin P. McHugh
Metadata curation, project administration, and software and analysis tools
- Siddharth Mookerjee
Metadata curation, project administration and visualization
- Stephen Aplin
- , Matthew Harvey
- , Thea Sass
- , Helen Umpleby
- & Helen Wheeler
Metadata curation, samples and logistics, and sequencing and analysis
- James P. McKenna
- , Ben Warne
- , Joshua F. Taylor
- , Yasmin Chaudhry
- , Rhys Izuagbe
- , Aminu S. Jahun
- , Gregory R. Young
- , Claire McMurray
- , Clare M. McCann
- , Andrew Nelson
- & Scott Elliott
Metadata curation, samples and logistics, and visualization
- Hannah Lowe
Metadata curation, sequencing and analysis, and software and analysis tools
- Anna Price
- , Matthew R. Crown
- , Sara Rey
- , Sunando Roy
- & Ben Temperton
Metadata curation, sequencing and analysis, and visualization
- Sharif Shaaban
- & Andrew R. Hesketh
Project administration, samples and logistics, and sequencing and analysis
- Kenneth G. Laing
- , Irene M. Monahan
- & Judith Heaney
Project administration, samples and logistics, and visualization
- Emanuela Pelosi
- , Siona Silviera
- & Eleri Wilson-Davies
Samples and logistics, software and analysis tools, and visualization
- Helen Fryer
Sequencing and analysis, software and analysis tools, and visualization
- Helen Adams
- , Louis du Plessis
- , Rob Johnson
- , William T. Harvey
- , Joseph Hughes
- , Richard J. Orton
- , Lewis G. Spurgin
- , Yann Bourgeois
- , Chris Ruis
- , Áine O’Toole
- , Marina Gourtovaia
- & Theo Sanderson
Funding acquisition, and leadership and supervision
- Christophe Fraser
- , Jonathan Edgeworth
- , Judith Breuer
- , Stephen L. Michell
- & John A. Todd
Funding acquisition and project administration
- Michaela John
- & David Buck
Leadership and supervision, and metadata curation
- Kavitha Gajee
- & Gemma L. Kay
Leadership and supervision, and project administration
- Sharon J. Peacock
- & David Heyburn
Leadership and supervision, and samples and logistics
- Katie Kitchman
- , Alan McNally
- , David T. Pritchard
- , Samir Dervisevic
- , Peter Muir
- , Esther Robinson
- , Barry B. Vipond
- , Newara A. Ramadan
- , Christopher Jeanes
- , Danni Weldon
- , Jana Catalan
- & Neil Jones
Leadership and supervision, and sequencing and analysis
- Ana da Silva Filipe
- , Chris Williams
- , Marc Fuchs
- , Julia Miskelly
- , Aaron R. Jeffries
- , Karen Oliver
- & Naomi R. Park
Metadata curation, and samples and logistics
- Amy Ash
- , Cherian Koshy
- , Magdalena Barrow
- , Sarah L. Buchan
- , Anna Mantzouratou
- , Gemma Clark
- , Christopher W. Holmes
- , Sharon Campbell
- , Thomas Davis
- , Ngee Keong Tan
- , Julianne R. Brown
- , Kathryn A. Harris
- , Stephen P. Kidd
- , Paul R. Grant
- , Li Xu-McCrae
- , Alison Cox
- , Pinglawathee Madona
- , Marcus Pond
- , Paul A. Randell
- , Karen T. Withell
- , Cheryl Williams
- , Clive Graham
- , Rebecca Denton-Smith
- , Emma Swindells
- , Robyn Turnbull
- , Tim J. Sloan
- , Andrew Bosworth
- , Stephanie Hutchings
- , Hannah M. Pymont
- , Anna Casey
- , Liz Ratcliffe
- , Christopher R. Jones
- , Bridget A. Knight
- , Tanzina Haque
- , Jennifer Hart
- , Dianne Irish-Tavares
- , Eric Witele
- , Craig Mower
- , Louisa K. Watson
- , Jennifer Collins
- , Gary Eltringham
- , Dorian Crudgington
- , Ben Macklin
- , Miren Iturriza-Gomara
- , Anita O. Lucaci
- & Patrick C. McClure
Metadata curation, and sequencing and analysis
- Matthew Carlile
- , Nadine Holmes
- , Christopher Moore
- , Nathaniel Storey
- , Stefan Rooke
- , Gonzalo Yebra
- , Noel Craine
- , Malorie Perry
- , Nabil-Fareed Alikhan
- , Stephen Bridgett
- , Kate F. Cook
- , Christopher Fearn
- , Salman Goudarzi
- , Ronan A. Lyons
- , Thomas Williams
- , Sam T. Haldenby
- , Jillian Durham
- & Steven Leonard
Metadata curation, and software and analysis tools
- Robert M. Davies
Project administration, and samples and logistics
- Rahul Batra
- , Beth Blane
- , Moira J. Spyer
- , Perminder Smith
- , Mehmet Yavus
- , Rachel J. Williams
- , Adhyana I. K. Mahanama
- , Buddhini Samaraweera
- , Sophia T. Girgis
- , Samantha E. Hansford
- , Angie Green
- , Charlotte Beaver
- , Katherine L. Bellis
- , Matthew J. Dorman
- , Sally Kay
- , Liam Prestwood
- & Shavanthi Rajatileka
Project administration, and sequencing and analysis
- Joshua Quick
Project administration, and software and analysis tools
- Radoslaw Poplawski
Samples and logistics, and sequencing and analysis
- Nicola Reynolds
- , Andrew Mack
- , Arthur Morriss
- , Thomas Whalley
- , Bindi Patel
- , Iliana Georgana
- , Myra Hosmillo
- , Malte L. Pinckert
- , Joanne Stockton
- , John H. Henderson
- , Amy Hollis
- , William Stanley
- , Wen C. Yew
- , Richard Myers
- , Alicia Thornton
- , Alexander Adams
- , Tara Annett
- , Hibo Asad
- , Alec Birchley
- , Jason Coombes
- , Johnathan M. Evans
- , Laia Fina
- , Bree Gatica-Wilcox
- , Lauren Gilbert
- , Lee Graham
- , Jessica Hey
- , Ember Hilvers
- , Sophie Jones
- , Hannah Jones
- , Sara Kumziene-Summerhayes
- , Caoimhe McKerr
- , Jessica Powell
- , Georgia Pugh
- , Sarah Taylor
- , Alexander J. Trotter
- , Charlotte A. Williams
- , Leanne M. Kermack
- , Benjamin H. Foulkes
- , Marta Gallis
- , Hailey R. Hornsby
- , Stavroula F. Louka
- , Manoj Pohare
- , Paige Wolverson
- , Peijun Zhang
- , George MacIntyre-Cockett
- , Amy Trebes
- , Robin J. Moll
- , Lynne Ferguson
- , Emily J. Goldstein
- , Alasdair Maclean
- & Rachael Tomb
Samples and logistics, and software and analysis tools
- Igor Starinskij
Sequencing and analysis, and software and analysis tools
- Laura Thomson
- , Joel Southgate
- , Moritz U. G. Kraemer
- , Jayna Raghwani
- , Alex E. Zarebski
- , Olivia Boyd
- , Lily Geidelberg
- , Chris J. Illingworth
- , Chris Jackson
- , David Pascall
- , Sreenu Vattipally
- , Timothy M. Freeman
- , Sharon N. Hsu
- , Benjamin B. Lindsey
- , Keith James
- , Kevin Lewis
- , Gerry Tonkin-Hill
- & Jaime M. Tovar-Corona
Sequencing and analysis, and visualization
- MacGregor Cox
Software and analysis tools, and visualization
- Khalil Abudahab
- , Mirko Menegazzo
- , Ben E. W. Taylor
- , Corin A. Yeats
- , Afrida Mukaddas
- , Derek W. Wright
- , Leonardo de Oliveira Martins
- , Rachel Colquhoun
- , Verity Hill
- , Ben Jackson
- , J. T. McCrone
- , Nathan Medd
- , Emily Scher
- & Jon-Paul Keatley
Leadership and supervision
- Tanya Curran
- , Sian Morgan
- , Patrick Maxwell
- , Ken Smith
- , Sahar Eldirdiri
- , Anita Kenyon
- , Alison H. Holmes
- , James R. Price
- , Tim Wyatt
- , Alison E. Mather
- , Timofey Skvortsov
- & John A. Hartley
Metadata curation
- Martyn Guest
- , Christine Kitchen
- , Ian Merrick
- , Robert Munn
- , Beatrice Bertolusso
- , Jessica Lynch
- , Gabrielle Vernet
- , Stuart Kirk
- , Elizabeth Wastnedge
- , Rachael Stanley
- , Giles Idle
- , Declan T. Bradley
- , Jennifer Poyner
- & Matilde Mori
Project administration
- Owen Jones
- , Victoria Wright
- , Ellena Brooks
- , Carol M. Churcher
- , Mireille Fragakis
- , Katerina Galai
- , Andrew Jermy
- , Sarah Judges
- , Georgina M. McManus
- , Kim S. Smith
- , Elaine Westwick
- , Stephen W. Attwood
- , Frances Bolt
- , Alisha Davies
- , Elen De Lacy
- , Fatima Downing
- , Sue Edwards
- , Lizzie Meadows
- , Sarah Jeremiah
- , Nikki Smith
- & Luke Foulser
Samples and logistics
- Themoula Charalampous
- , Amita Patel
- , Louise Berry
- , Tim Boswell
- , Vicki M. Fleming
- , Hannah C. Howson-Wells
- , Amelia Joseph
- , Manjinder Khakh
- , Michelle M. Lister
- , Paul W. Bird
- , Karlie Fallon
- , Thomas Helmer
- , Claire L. McMurray
- , Mina Odedra
- , Jessica Shaw
- , Julian W. Tang
- , Nicholas J. Willford
- , Victoria Blakey
- , Veena Raviprakash
- , Nicola Sheriff
- , Lesley-Anne Williams
- , Theresa Feltwell
- , Luke Bedford
- , James S. Cargill
- , Warwick Hughes
- , Jonathan Moore
- , Susanne Stonehouse
- , Laura Atkinson
- , Jack C. D. Lee
- , Divya Shah
- , Adela Alcolea
- , Natasha Ohemeng-Kumi
- , John Ramble
- , Jasveen Sehmi
- , Rebecca Williams
- , Wendy Chatterton
- , Monika Pusok
- , William Everson
- , Anibolina Castigador
- , Emily Macnaughton
- , Kate El Bouzidi
- , Temi Lampejo
- , Malur Sudhanva
- , Cassie Breen
- , Graciela Sluga
- , Shazaad S. Y. Ahmad
- , Ryan P. George
- , Nicholas W. Machin
- , Debbie Binns
- , Victoria James
- , Rachel Blacow
- , Lindsay Coupland
- , Louise Smith
- , Edward Barton
- , Debra Padgett
- , Garren Scott
- , Aidan Cross
- , Mariyam Mirfenderesky
- , Jane Greenaway
- , Kevin Cole
- , Phillip Clarke
- , Nichola Duckworth
- , Sarah Walsh
- , Kelly Bicknell
- , Robert Impey
- , Sarah Wyllie
- , Richard Hopes
- , Chloe Bishop
- , Vicki Chalker
- , Ian Harrison
- , Laura Gifford
- , Zoltan Molnar
- , Cressida Auckland
- , Cariad Evans
- , Kate Johnson
- , David G. Partridge
- , Mohammad Raza
- , Paul Baker
- , Stephen Bonner
- , Sarah Essex
- , Leanne J. Murray
- , Andrew I. Lawton
- , Shirelle Burton-Fanning
- , Brendan A. I. Payne
- , Sheila Waugh
- , Andrea N. Gomes
- , Maimuna Kimuli
- , Darren R. Murray
- , Paula Ashfield
- , Donald Dobie
- , Fiona Ashford
- , Angus Best
- , Liam Crawford
- , Nicola Cumley
- , Megan Mayhew
- , Oliver Megram
- , Jeremy Mirza
- , Emma Moles-Garcia
- , Benita Percival
- , Leah Ensell
- , Helen L. Lowe
- , Laurentiu Maftei
- , Matteo Mondani
- , Nicola J. Chaloner
- , Benjamin J. Cogger
- , Lisa J. Easton
- , Hannah Huckson
- , Jonathan Lewis
- , Sarah Lowdon
- , Cassandra S. Malone
- , Florence Munemo
- , Manasa Mutingwende
- , Roberto Nicodemi
- , Olga Podplomyk
- , Thomas Somassa
- , Andrew Beggs
- , Alex Richter
- , Claire Cormie
- , Joana Dias
- , Sally Forrest
- , Ellen E. Higginson
- , Mailis Maes
- , Jamie Young
- , Rose K. Davidson
- , Kathryn A. Jackson
- , Lance Turtle
- , Alexander J. Keeley
- , Jonathan Ball
- , Timothy Byaruhanga
- , Joseph G. Chappell
- , Jayasree Dey
- , Jack D. Hill
- , Emily J. Park
- , Arezou Fanaie
- , Rachel A. Hilson
- , Geraldine Yaze
- & Stephanie Lo
Sequencing and analysis
- Safiah Afifi
- , Robert Beer
- , Joshua Maksimovic
- , Kathryn McCluggage Masters
- , Karla Spellman
- , Catherine Bresner
- , William Fuller
- , Angela Marchbank
- , Trudy Workman
- , Ekaterina Shelest
- , Johnny Debebe
- , Fei Sang
- , Marina Escalera Zamudio
- , Sarah Francois
- , Bernardo Gutierrez
- , Tetyana I. Vasylyeva
- , Flavia Flaviani
- , Manon Ragonnet-Cronin
- , Katherine L. Smollett
- , Alice Broos
- , Daniel Mair
- , Jenna Nichols
- , Kyriaki Nomikou
- , Lily Tong
- , Ioulia Tsatsani
- , Sarah O’Brien
- , Steven Rushton
- , Roy Sanderson
- , Jon Perkins
- , Seb Cotton
- , Abbie Gallagher
- , Elias Allara
- , Clare Pearson
- , David Bibby
- , Gavin Dabrera
- , Nicholas Ellaby
- , Eileen Gallagher
- , Jonathan Hubb
- , Angie Lackenby
- , David Lee
- , Nikos Manesis
- , Tamyo Mbisa
- , Steven Platt
- , Katherine A. Twohig
- , Mari Morgan
- , Alp Aydin
- , David J. Baker
- , Ebenezer Foster-Nyarko
- , Sophie J. Prosolek
- , Steven Rudder
- , Chris Baxter
- , Sílvia F. Carvalho
- , Deborah Lavin
- , Arun Mariappan
- , Clara Radulescu
- , Aditi Singh
- , Miao Tang
- , Helen Morcrette
- , Nadua Bayzid
- , Marius Cotic
- , Carlos E. Balcazar
- , Michael D. Gallagher
- , Daniel Maloney
- , Thomas D. Stanton
- , Kathleen A. Williamson
- , Robin Manley
- , Michelle L. Michelsen
- , Christine M. Sambles
- , David J. Studholme
- , Joanna Warwick-Dugdale
- , Richard Eccles
- , Matthew Gemmell
- , Richard Gregory
- , Margaret Hughes
- , Charlotte Nelson
- , Lucille Rainbow
- , Edith E. Vamos
- , Hermione J. Webster
- , Mark Whitehead
- , Claudia Wierzbicki
- , Adrienn Angyal
- , Luke R. Green
- , Max Whiteley
- , Emma Betteridge
- , Iraad F. Bronner
- , Ben W. Farr
- , Scott Goodwin
- , Stefanie V. Lensing
- , Shane A. McCarthy
- , Michael A. Quail
- , Diana Rajan
- , Nicholas M. Redshaw
- , Carol Scott
- , Lesley Shirley
- & Scott A. J. Thurston
Software and analysis tools
- Will Rowe
- , Amy Gaskin
- , Thanh Le-Viet
- , James Bonfield
- , Jennifier Liddle
- & Andrew Whitwham

Contributions

H.S.V. and M.G. developed the analysis code, which H.S.V. implemented with input from A.W.J.; H.S.V. created most of the figures. M.S. analysed, annotated and aggregated viral genome data. N.D.M. conducted phylogeographic analyses supervised by N.G.; T.S., R.G., M.S. and H.S.V. developed the interactive spatiotemporal viewer. T.N., F.S., I.H., R.A., C.A., S.G., D.J., I.J., C.S., J.S., T.S. and M.S. analysed genomic surveillance data under the supervision of D.K., M.C., I.M. and J.C.B.; J.H. and S.F. analysed ONS data and helped with epidemiological modelling and data interpretation. E.V. analysed growth rates and helped with data interpretation. E.B. and J.P.G. supervised H.S.V. and helped with data interpretation. J.C.B. and M.G. supervised the analysis with advice from I.M.; M.G., H.S.V., M.S., N.D.M., T.S., I.M. and J.C.B. wrote the manuscript with input from all of the co-authors.

Corresponding authors

Correspondence to Jeffrey C. Barrett or Moritz Gerstung.

Ethics declarations

Competing interests

E.B. is a paid consultant of Oxford Nanopore.

Additional information

Peer review information Nature thanks Tulio De Oliveira, Philippe Lemey and Matthew Scotch for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 SARS-CoV-2 surveillance sequencing in England between September 2020 and June 2021.

a. Local monthly coverage across 315 LTLAs. b. Weekly coverage of genomic surveillance sequencing. c. Hospitalization, case and infection fatality rates relative to ONS prevalence. Dots denote mean estimates and error bars 95% CIs.

Extended Data Fig. 2 Genomic surveillance model of total incidence and lineage-specific frequencies.

a. Cubic basis splines (top row) are convolved with the infection to test distribution (row 2 and 3) and used to fit the log incidence in a LTLA and its corresponding derivatives (growth rates; bottom row). b. Example incidence (top row), logarithmic incidence with individual convolved basis functions (dashed lines, row 2), growth rate with individual spline basis derivatives (dashed lines, row 3) and resulting (case) reproduction numbers (growth rate per 5.1d) from our approach (GenomicSurveillance) and estimates by EpiEstim⁴⁸, shifted by 10d to approximate a case reproduction number. c. The relative frequencies of 62 different lineages are modelled using piecewise multinomial logistic regression. The linear logits are modelled to jump stochastically within 21d prior to first observation to account for the effects of new introductions. Shown are the logits of 5 selected lineages in two different LTLAs.

Extended Data Fig. 3 Genomic surveillance model selection.

a. Model loss in terms of the ELBO objective function and the model hyperparameters alpha0 and alpha1 (see Methods). b. Model deviance (calculated as −2 x log pointwise predictive density) with respect to the model hyperparameters α₀ and α₁ (see Methods). c. Mean squared error (MSE) of modelled weekly proportions of highly prevalent lineages with respect to the model parameters α₀ and α₁ (see Methods). d. Same as in c, but for lineages exhibiting low frequencies (VOCs).

Extended Data Fig. 4 Spatiotemporal model of 71 SARS-CoV-2 lineages in 315 English LTLAs between September 2020 and June 2021.

a. Regional lineage specific relative frequency of lineages contributing more than 50 genomes during the time period shown. Dots denote observed data, lines the fits aggregated to each region. b. Same as a, but on a log scale. c. Same data as in a, shown as stacked bar charts. Colours resemble major lineages as indicated and shadings thereof indicate sublineages. d. Same fits as in a, shown as stacked segments. e. Average growth rates for 71 SARS-Cov2 lineages estimated in different regions in England. Dots denote median estimates and error bars 95% CIs.

Extended Data Fig. 5 Relative growth of B.1.177.

a. Lineage-specific relative frequency data in England, excluding B.1.1.7 and other VOCs/VUIs (Category Other includes: A, A.18, A.20, A.23, A.25, A.27, A.28, B, B.29, B.40, None). Colours resemble major lineages as indicated and shadings thereof indicate sublineages. b. Lineage-specific relative frequency data in Denmark, excluding B.1.1.7 and other VOCs/VUIs. Colours resemble major lineages as indicated and shadings thereof indicate sublineages.

Extended Data Fig. 6 Genomic diversity of the SARS-CoV-2 epidemic.

Shown is the entropy (blue), total number of observed Pango lineages (grey, divided by 4), as well as the proportion of B.1.1.7 (orange, right axis). The sweep of B.1.1.7 causes an intermittent decline of genomic diversity as measured by the entropy.

Extended Data Fig. 7 Global phylogenetic trees of selected VOCs/VUIs.

English surveillance and other (targeted and quarantine) samples are highlighted respectively orange and red.

Extended Data Fig. 8 Global phylogenetic trees of B.1.617 sublineages.

a, b and c. English surveillance and other (targeted and quarantine) samples are highlighted respectively orange and red. The trees of B.1.617.1 and B.1.617.2 are rooted. d. Number of UK introductions inferred by parsimony (minimum and maximum numbers) and by Thorney BEAST (95% posterior CI) for each VOC.

Supplementary information

Supplementary Notes

Supplementary Notes 1–3.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–4.

Source data

Source Data Fig. 1

Source Data Fig. 2

Source Data Fig. 3

Source Data Fig. 4

Source Data Fig. 5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vöhringer, H.S., Sanderson, T., Sinnott, M. et al. Genomic reconstruction of the SARS-CoV-2 epidemic in England. Nature 600, 506–511 (2021). https://doi.org/10.1038/s41586-021-04069-y

Download citation

Received: 22 May 2021
Accepted: 29 September 2021
Published: 14 October 2021
Issue Date: 16 December 2021
DOI: https://doi.org/10.1038/s41586-021-04069-y

This article is cited by

Real-time identification of epistatic interactions in SARS-CoV-2 from large genome collections
- Gabriel Innocenti
- Maureen Obara
- Marco Galardini
Genome Biology (2024)
Importations of SARS-CoV-2 lineages decline after nonpharmaceutical interventions in phylogeographic analyses
- Sama Goliaei
- Mohammad-Hadi Foroughmand-Araabi
- Alice C. McHardy
Nature Communications (2024)
Maximum likelihood pandemic-scale phylogenetics
- Nicola De Maio
- Prabhav Kalaghatgi
- Nick Goldman
Nature Genetics (2023)
Fitness, growth and transmissibility of SARS-CoV-2 genetic variants
- Erik Volz
Nature Reviews Genetics (2023)
Comparing SARS-CoV-2 variants among children and adolescents in Germany: relative risk of COVID-19-related hospitalization, ICU admission and mortality
- Marietta Jank
- Anna-Lisa Oechsle
- Horst Schroten
Infection (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Main

Epidemiology of SARS-CoV-2 in England

Modelling the dynamics of SARS-CoV-2

Multiple subepidemics in autumn 2020

The spread of Alpha during restrictions

Elimination of lineages in early 2021

Refractory variants with E484K mutations

The rise of Delta from April to June 2021

Discussion

Methods

Pillar 2 SARS-CoV-2 testing data

SARS-CoV-2 surveillance sequencing

Spatiotemporal genomic surveillance model

Motivation

Incidence

Growth rates and R t values

Genomic prevalence

Lineage-specific incidence and growth rates

Inference

Phylogeographic analyses

ONS infection survey analysis

Maps

Limitations

Ethical approval

Reporting summary

Data availability

Code availability

Change history

07 June 2022

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

The Wellcome Sanger Institute COVID-19 Surveillance Team

The COVID-19 Genomics UK (COG-UK) Consortium*

Funding acquisition, leadership and supervision, metadata curation, project administration, samples and logistics, sequencing and analysis, software and analysis tools, and visualization

Funding acquisition, leadership and supervision, metadata curation, project administration, samples and logistics, sequencing and analysis, and software and analysis tools

Leadership and supervision, metadata curation, project administration, samples and logistics, sequencing and analysis, software and analysis tools, and visualization

Funding acquisition, leadership and supervision, metadata curation, samples and logistics, sequencing and analysis, and visualization

Funding acquisition, leadership and supervision, project administration, samples and logistics, sequencing and analysis, and software and analysis tools

Funding acquisition, leadership and supervision, project administration, sequencing and analysis, software and analysis tools, and visualization

Funding acquisition, metadata curation, project administration, samples and logistics, sequencing and analysis, and software and analysis tools

Leadership and supervision, metadata curation, project administration, samples and logistics, software and analysis tools, and visualization

Funding acquisition, leadership and supervision, metadata curation, project administration, and samples and logistics

Funding acquisition, leadership and supervision, metadata curation, samples and logistics, and sequencing and analysis

Funding acquisition, leadership and supervision, metadata curation, sequencing and analysis, and software and analysis tools

Funding acquisition, leadership and supervision, project administration, samples and logistics, and sequencing and analysis

Leadership and supervision, metadata curation, project administration, samples and logistics, and sequencing and analysis

Leadership and supervision, metadata curation, project administration, samples and logistics, and visualization

Leadership and supervision, metadata curation, project administration, sequencing and analysis, and software and analysis tools

Leadership and supervision, metadata curation, samples and logistics, sequencing and analysis, and visualization

Leadership and supervision, metadata curation, sequencing and analysis, software and analysis tools, and visualization

Metadata curation, project administration, samples and logistics, sequencing and analysis, and software and analysis tools

Funding acquisition, samples and logistics, sequencing and analysis, and visualization

Funding acquisition, leadership and supervision, metadata curation and project administration

Funding acquisition, leadership and supervision, metadata curation, and samples and logistics

Funding acquisition, leadership and supervision, metadata curation, and sequencing and analysis

Funding acquisition, leadership and supervision, project administration, and samples and logistics

Funding acquisition, leadership and supervision, sequencing and analysis, and visualization

Leadership and supervision, metadata curation, project administration, and sequencing and analysis

Leadership and supervision, metadata curation, project administration, and samples and logistics

Leadership and supervision, metadata curation, samples and logistics, and sequencing and analysis

Leadership and supervision, metadata curation, samples and logistics, and visualization

Leadership and supervision, metadata curation, sequencing and analysis, and software and analysis tools

Leadership and supervision, project administration, samples and logistics, and sequencing and analysis

Leadership and supervision, sequencing and analysis, software and analysis tools, and visualization

Metadata curation, project administration, samples and logistics, and sequencing and analysis

Metadata curation, project administration, sequencing and analysis, and software and analysis tools

Metadata curation, samples and logistics, sequencing and analysis, and software and analysis tools

Metadata curation, samples and logistics, sequencing and analysis, and visualization

Metadata curation, sequencing and analysis, software and analysis tools, and visualization

Project administration, sequencing and analysis, software and analysis tools, and visualization

Funding acquisition, leadership and supervision, and metadata curation

Funding acquisition, leadership and supervision, and project administration

Funding acquisition, leadership and supervision, and samples and logistics

Growth rates and R _t values