Primary author: David Jorgensen
Olivia Boyd, Lily Geidelberg, David Jorgensen, Manon Ragonnet, Igor Siveroni, Erik Volz and the Imperial College COVID-19 Response Team
Report prepared on 2020-05-28
This report uses full genome sequence data shard publicly by Gujarat Biotechnology Research Centre and a set of international background sequences from GISAID (laboratory acknowledgements)
- A preliminary analysis on Gujarat was previously published on the blog. As more sequence data is now available we are able to focus on Ahmedabad, the largest city and main focal point of SARS-CoV 2 transmission in Gujarat. Future updates will be at this scale. This analysis is still preliminary and based on a small number of sequences from a large, densely populated city.
- We estimate a similar initial R (R0) for Ahmedabad as previously estimated for the state of Gujarat at approximately 2. As before we estimate a modest decline in R over time with available sequence data.
This is analysis is based on :
- 44 whole genomes sampled from within Ahmedabad
- 51 whole genomes sampled from outside of Ahmedabad
- Samples within Ahmedabad were collected between 2020-04-07 and 2020-05-04
As duplicate sequences, those with likely sequencing errors or significant gaps were removed prior to analysis this represents a smaller sample than the total number of Ahmedabad samples uploaded to gisaid. Figure 1 shows the distribution of the included sequences over time, including external sequences.
Reported cases for comparison to our model predictions are taken from covid19india.org, a crowdsourced database for SARS-CoV-2 data from India and so may not match reported cases from government sources exactly. These data are used for comparison purposes only and do not influence the analysis. As district level data from the covid19india API were available only for the past month, case numbers for Ahmedabad were estimated to be 2/3 of the cases from the state as a whole at all time points. This fits with current reporting but is likely to be inaccurate back in time and will be updated when adequate district level data is sourced.
Figure 1: Sampling distributions over time of number of sequences included within the region versus sequences included from the international reservoir.
How many are infected in Ahmedabad?
In this analysis we estimate 12572 [3270-60827] median [95%CI] cumulative infections at the time of the last sample (2020-05-04) by fitting a phylodynamic model to SARS-CoV 2 sequence data. We estimate 3482 reported cases of SARS-CoV 2 in Ahmedabad at the same time point. This is similar to the number of cases estimated for the state of Gujarat in the previous analysis, demonstrating the influence of additional sequence data on this estimate and the importance of Ahmedabad to coronavirus transmission in Gujarat. Epidemiological modelling by other groups suggests a lower reporting rate for coronavirus than estimated in this analysis suggesting it is likely still limited by the depth of available sequence data.
Figure 2: Estimated cumulative infections through time represented by solid black line (median) and 95% CrI (ribbon). Points represent reported cases in Ahmedabad. The dashed line indicates the date of last sample in Ahmedabad in this analysis.
Figure 3: Estimated daily infections through time represented by solid black line (median) and 95% CrI (ribbon). Points represent reported cases in Ahmedabad. The dashed line indicates the date of last sample in Ahmedabad in this analysis.
Figure 4: Estimated percentage of daily cases reported in Ahmedabad. Error bars represent the 95% credible interval.
*Figure 5: Reproduction number through time. The black vertical dashed line indicates the date of last sample in Ahmedabad in this analysis. The red dashed line indicates the date of lockdown in India. *
Reproduction number at last sample (2020-05-04): 1.65 [2.05-0.61] median [95% CrI]
How quickly has the epidemic in Ahmedabad grown?
|Quantile||Reproduction number||Growth rate (per day)||Doubling time (days)|
Table 1: Reproduction number, growth rate and doubling times
How has SARS-CoV 2 evolved in Ahmedabad?
Figure 6: Maximum likelihood phylogeny with the x-axis representing NT substitutions per site. The colour of the tips corresponds to sampling location; red tips were sampled from within the region, grey tips from outside
Figure 7: Time scaled phylogeny co-estimated with epidemiological parameters. The colour of the tips corresponds to sampling location; red tips were sampled from within Ahmedabad.
We present here a time scaled phylogeny of SARS-CoV-2 in Ahmedabad and the included international reservoir. As few nodes have high posterior support we also present a maximum likelhood phylogeny showing genetic distance between the included sequences. Ahmedabad sequences tend to cluster closely together suggesting that the majority of sequenced SARS-CoV-2 results from transmission within the region rather than introductions from elsewhere. This could be the result of significant local transmission or due to a local sampling strategy targeted at interconnected individuals.
Molecular clock rate of evolution: 0.000966 [0.000735-0.00126] median [95% CrI]
Details on methods and priors can be found here.
Table 2: Effective sample size of model parameters
Model version: seijr_0.1.1_coupled
Report version: 20200528-112517-95f05a03
This work was supported by the MRC Centre for Global Infectious Disease Analysis at Imperial College London.