COVID mortality in India: National survey data and health facility deaths

India’s national COVID death totals remain undetermined. Using an independent nationally representative survey of 0.14 million (M) adults, we compared COVID mortality during the 2020 and 2021 viral waves to expected all-cause mortality. COVID constituted 29% (95% confidence interval, 28 to 31%) of deaths from June 2020 to July 2021, corresponding to 3.2 M (3.1 to 3.4) deaths, of which 2.7 M (2.6 to 2.9) occurred in April to July 2021 (when COVID doubled all-cause mortality). A subsurvey of 57,000 adults showed similar temporal increases in mortality, with COVID and non-COVID deaths peaking similarly. Two government data sources found that, when compared to prepandemic periods, all-cause mortality was 27% (23 to 32%) higher in 0.2 M health facilities and 26% (21 to 31%) higher in civil registration deaths in 10 states; both increases occurred mostly in 2021. The analyses find that India’s cumulative COVID deaths by September 2021 were six to seven times higher than reported officially.

India's national COVID death totals remain undetermined. Using an independent nationally representative survey of 0.14 million (M) adults, we compared COVID mortality during the 2020 and 2021 viral waves to expected all-cause mortality. COVID constituted 29% (95% confidence interval, 28 to 31%) of deaths from June 2020 to July 2021, corresponding to 3.2 M (3.1 to 3.4) deaths, of which 2.7 M (2.6 to 2.9) occurred in April to July 2021 (when COVID doubled all-cause mortality). A subsurvey of 57,000 adults showed similar temporal increases in mortality, with COVID and non-COVID deaths peaking similarly. Two government data sources found that, when compared to prepandemic periods, allcause mortality was 27% (23 to 32%) higher in 0.2 M health facilities and 26% (21 to 31%) higher in civil registration deaths in 10 states; both increases occurred mostly in 2021. The analyses find that India's cumulative COVID deaths by September 2021 were six to seven times higher than reported officially. A s of 1 January 2022 and prior to the current surge driven by the Omicron variant, India reported over 35 million cases of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), second only to the United States (1). India's official cumulative COVID death count of 0.48 million implies a COVID death rate of~345 per million population, about one-seventh of the US death rate (2). India's reported COVID death totals are widely believed to be underreports because of incomplete certification of COVID deaths and misattribution to chronic diseases and because most deaths occur in rural areas, often without medical attention (3,4). Of India's 10 million deaths estimated by the United Nations Population Division (UNPD) in 2020, over 3 million were not registered and over 8 million did not undergo medical certification (fig. S1 and table S1).
Model-based estimates of cumulative COVID deaths through June 2021 in India range from a few hundred thousand to more than 4 million, with most suggesting a substantial official undercount (5)(6)(7)(8)(9)(10)(11)(12) (table S2). However, models start with official reports and apply varying assumptions, leading to wide or implausible estimates. In the absence of near universal and timely death registration and the lack of release of data from India's Sample Registration System (SRS), which tracks deaths in a random sample of about 1% of Indian homes (13), alternative approaches are needed to estimate COVID deaths. Recorded increases in all-cause mortality during peak pandemic transmission are likely nearly all caused by COVID infection (14). The World Health Organization (WHO) has recognized such counts as a crude but useful method to track the pandemic (15). Reports by journalists and nongovernmental organizations using civil registration system (CRS) data have documented a large increase in deaths from all causes compared with previous years (16). Unfortunately, CRS data are reliably available only in states that cover about half of the estimated total deaths in India and may be affected by changes in the level of registration. Given the marked heterogeneity in the temporal patterns of confirmed COVID mortality cases and deaths across states (17), and the variable background of mortality rates from chronic diseases affected by COVID infection (3), extrapolating from selected states has its limitations.
To fill the gaps in national-level estimates, we quantified COVID mortality in India using one independent and two government data sources. The first study is mortality reported in a nationally representative telephone survey conducted by CVoter, an established, independent, private polling agency, which launched the survey on a nonprofit basis to help track the pandemic [see materials and methods, p. 2 (18)]. The COVID Tracker survey covers 0.14 million adults (including a substudy of 57,000 people in 13,500 households with more exact reporting of COVID and non-COVID deaths in immediate family members) (18,19). In addition, we studied the Government of India's administrative data on national facility-based deaths and CRS deaths in 10 states ( fig. S2).
The CVoter Tracker survey is a nationally representative, random probability-based computer-assisted telephone interview survey carried out daily to track governance, media, and other socioeconomic indicators (19). In March 2020, it began to capture COVID symptoms among adults aged 18 years or older, covering~2100 randomly selected respondents weekly, drawn from~4000 local electoral areas in the whole of the country, providing a rolling 7-day average of COVID symptoms and deaths. The survey covers >98% of Indian population by geography, with interviews in 11 languages. The response rate was 55%; 137,289 respondents in all states and union territories were interviewed from March 2020 to July 2021.
Our numerator was defined as the average weekly percentages of surveyed households reporting a COVID death (defined by the household, as medical certification remains uncommon in India; fig. S1). We excluded the 16% of reported COVID deaths that were below age 35 years (confirmed COVID deaths below this age are infrequent; fig. S3) and subtracted a fixed percentage of 0.59%, which was an assumed value for reported deaths that did not occur among immediate family members. The assumed value drew on observed background rates during February-March 2021, when few COVID cases or deaths were reported in the official government data (see materials and methods, p. 3). Results using survey weights or raw proportions were similar, so we used the latter. We compared these survey-reported COVID deaths to a denominator defined as the expected weekly percentage for allcause deaths, based on 2020 death totals from the UNPD's comprehensive demographic estimates that combine censuses, survey data, and models (20) (Fig. 1). India had about 296 million households in 2020, with an average household size of 4.6 (21). Dividing this into the 10.16 million deaths estimated by the UNPD in India in 2020 yields~3.4% of households expected to report a death from any cause in that year (with nearly identical results for 2021). To this expected allcause proportion, we applied the weekly variation observed in the Million Death Study, a large and representative mortality study conducted within the SRS (3).
For most of the weeks from June 2020 to March 2021, zero to 0.7% of households in the CVoter survey reported a COVID death. Even the upper value of 0.7% during some weeks corresponded to 20% of the expected annual all-cause death proportion of 3.4%. During the first viral peak, 1.2% of households reported a COVID death (or about 35% of expected allcause deaths) over 10 days from 24 September to 4 October 2020. There was a second sharp increase in reported COVID deaths from mid-April to the end of June 2021, reaching weekly peaks close to 6% of households. From 1 April to 1 July 2021, the proportion of households reporting COVID deaths was 3.7%, which was 108% [95% lower limits (LL) and upper limits (UL), 103 to 113%] of the expected all-cause deaths of 3.4% (Table 1). The same comparison for 1 June to 31 December 2020 showed that COVID deaths were 8.1% (7.7 to 8.5%) of expected all-cause deaths.
Applying these proportions to expected overall deaths from 1 June 2020 to 1 July 2021 yielded an estimate of 3.2 million (3.1 to 3.4) COVID deaths, or 29% (28 to 31%) of expected all-cause deaths during the 13-month period, including during the interspersed weeks of assumed lower transmission. The majority of COVID deaths that India experienced throughout the pandemic occurred from 1 April to 1 July 2021 (2.7 million; 2.6 to 2.9). Given that the subtraction value for nonhousehold reporting of COVID deaths was somewhat subjective, we ran sensitivity analyses of 50% and 150% of our baseline of 0.59%, yielding estimates ranging from 2.5 (2.4 to 2.6) to 4.0 (3.8 to 4.1) million COVID deaths.
The COVID Tracker survey's introductory question focused on flu-like symptoms among immediate family members, but the COVID question asked: "Has anyone in your family or surroundings been infected from Corona Virus?" If the self-reported answer was yes, respondents were asked whether the infected individual had died. To address a possible limitation of overreporting (i.e., COVID deaths in "surroundings" but not in the household), from 15 June to 1 Sept 2021 we implemented a substudy among a randomly selected 10% of households from the COVID Tracker Panel.
We ascertained from~57,000 people in 13,500 households who lived in the immediate household as of 1 January 2019, who died and when, and if the respondent thought the death was due to COVID or a non-COVID cause ( Fig. 2 and (20)]. Total COVID deaths reported in 2020 (162) and 2021 (553) corresponded to 1.2% and 4.1% of households re-porting a COVID death, comparable to the proportions in the main COVID Tracker survey. The crude death rate in the substudy more than doubled in 2021 compared to 2019, also consistent with the increase in COVID deaths in the main survey. Compared to 2019, the increase in non-COVID deaths reported during September-October 2020 exceeded reported COVID deaths, but the reverse was true during April-June 2021. This likely reflects the misclassification of non-COVID deaths; COVID infection raises death rates not just from respiratory disease but also from vascular disease, kidney disease, and other causes (22). The Government of India's daily confirmed COVID death totals from 1 June 2020 to 1 July 2021 strongly correlated with the daily death totals in the CVoter main survey (correlation 0.88, p < 0.0001). The government's confirmed COVID death totals for each month from 1 April 2020 to 1 July 2021 correlated with the monthly COVID deaths in the CVoter substudy (correlation 0.84, p < 0.001; fig. S4).
We examined two government-reported data sources as comparisons to the independent CVoter survey. The first data source comprised facility-based all-cause mortality covering a nonrepresentative sample of 0.2 million public hospitals and smaller facilities nationally, more than 90% of them rural (23) (Fig. 3). Compared to 2018-2019, all-cause deaths increased 27% (23 to 32%) during 1 July 2020 to 31 May 2021, equivalent to an excess of 0.63 million deaths (0.53 to 0.73) of 2.32 million expected for the 11 months (Table 1)  reporting a 230% increase and Kerala a 37% increase. In Andhra Pradesh, which had reasonably high coverage of expected rural deaths in facilities, the major increase during April-May 2021 was for deaths of unknown cause, followed by nontuberculosis respiratory conditions, heart disease, and other chronic disease, with a small decrease in death from injuries (table S4). Analysis of increases in overall mortality may therefore better capture the diverse diseases affected by COVID infection.
The second government data source was allcause deaths in the CRS for 10 states with 10 or more months of observations (including the interspersed periods between the two viral waves). In these states, the combined median increase, as a percentage of expected deaths based on UNPD death rates, was 26% (21 to 31%; Table 1). Total excess all-cause deaths were 1.25 million (1.00 to 1.49) for the 10 states that reported about half of national official COVID deaths (2). The median ratio of excess to confirmed COVID deaths ranged from six to seven in the two viral waves for these states (table S5).
Our national estimates of 3.1 to 3.4 million COVID deaths help fill a gap in knowledge from focal or model-based studies (table S2). During the 13 months between 1 June 2020 and 1 July 2021, the proportions of excess deaths from COVID in the national survey (28 to 31%) were comparable to the proportions from all causes in the national facility data (23 to 32%) or the CRS data in 10 states (21 to 31%). However, the major uncertainties in these estimates are not the relatively narrow confidence intervals, but the assumptions about the nonpandemic mortality rates (24). Despite varying methodologies, each with its own limitations, our three studies and those published earlier (table S2) point to a substantial underreporting of deaths in India's official numbers. Most find a much larger excess of deaths in the second viral wave than in the first. Indeed, the COVID pandemic likely doubled the total death rate from all conditions in April-June 2021.
The estimates of 3.1 to 3.4 million deaths from the independent COVID Tracker survey represent a national COVID death rate per million population ranging from about 2300 to 2500, or approximately six-to sevenfold the officially reported rate on 1 September 2021 (1). This would put India's death rate per million population just below the range reported in Brazil (2800 per million) or Colombia (2500 per million), where registration of deaths is far more complete (15). The actual excess deaths in the facilities may be larger as the Government of India has yet to release these data from June 2021 onward. More definitive quantification of excess mortality can be expected once the Registrar General of India relaunches its SRS (13) to cover all deaths occurring in 2020 and 2021. Indeed, the extraordinary COVID death totals that we document warrant adding a simple question on the age, sex, and date of any death (regardless of cause) occurring in 2020 or 2021 to the 2022 national census. Concurrently, India must expand and improve its death registration and medical certification system, with timelier reporting (25). Uncounted or medically uncertified deaths are not uniform, with larger gaps in the poorest states in central India and larger gaps among women than among men ( fig. S1 and table S1).
Both the 2020 and 2021 viral waves were characterized by widespread (and, for 2021, mostly uncontrolled) multigenerational transmission of the virus within households, with high levels of antibodies detected (17). India's notably higher COVID death rate in 2021, compared to the lower than expected death rate in 2020, requires further research. The spread of infection to rural areas in 2021 is one factor, but there might also be differences in the pathogenicity between the original virus (Wuhan) in 2020 and the mix of Alpha and Delta variants accounting for most of the 2021 viral wave (26), or other biological predictors of severe infection that changed between these two waves. Similarly, tracking death rates will be essential to understanding the effects of the Omicron wave currently underway in India, or future viral variants.
The strengths of our study are its national representativeness and distributed sampling 3 of 5  for the survey, use of three data sources, and robust metrics that document increased deaths versus earlier years or expected demographic totals. Our methods are reproducible over time and avoid the limitations of model-based estimates. We focused on increased mortality only in the short time periods of pandemic peaks and assumed no excess mortality between viral peaks. COVID deaths typically are acute, occurring within weeks of infection, but the full effects of COVID infection on various underlying diseases are unknown. Thus, our results are conservative. As we had to rely on household self-reports, we adjusted for possible overreporting of deaths and ensured that denominators of CRS deaths considered the underlying deficiencies in death reporting in India. Nonetheless, we faced several limitations. We compared COVID deaths to expected allcause mortality in the national survey, and in so doing, we might have underestimated the totals that in part arose from increases in deaths misclassified as non-COVID. The metric of excess mortality has limitations as some causes-notably road traffic accidents, non-COVID infections, or other injuries-may have decreased, particularly during COVID lockdowns (table S4). However, the nationally representative Million Death Study, conducted within the SRS, documented that injuries constituted less than 1 in 10 of all deaths in India from 2004 to 2014 (3). By contrast, other causes, including those linked to poor mental health, may have risen, as seen in the US (22). There might also be an increase in some deaths from neglected health services, as reflected in reports that maternal mortality rose during the pandemic months (16), but these may also represent COVID infection among pregnant women (27). Changes in non-COVID causes of death are likely to be small compared with the sharp increases in COVID deaths, particularly during the second viral wave. Household self-reports of deaths likely misclassified various conditions that are in fact COVID-related (28). The COVID Tracker survey data might have overreported COVID deaths, as the questions were not restricted to immediate family members, but the subsurvey, which did not have this limitation, yielded very similar results. Rural facility death reporting may have been biased upward if more people than usual sought care during high transmission months. Delays in death registration or a backlog of deaths corrected suddenly might create a spurious peak of ex-cess deaths. However, in the case of Andhra Pradesh, 98% of deaths registered in May 2021 took place within the previous 30 days, not earlier time periods (16).
In sum, our study finds that Indian COVID deaths are substantially greater than estimated from official reports. If our findings are confirmed, this may require substantial upward revision of WHO's estimates of cumulative global COVID mortality, which as of 1 January 2022, stood at 5.4 million (15). Table 1. Summary estimates of excess deaths in India nationally and for states with 10 or more months of data (including the interim weeks or months that were not pandemic).    *For the independent national survey, these are COVID deaths, and for facility and civil registration deaths, these are all-cause excess deaths. †Out of the annual average of 428,000 CRS deaths for prepandemic years of Rajasthan (table S1), only 218,000 deaths (table S6) or 51% were available for our calculations. Similarly, for Maharashtra, only 66% of CRS deaths were available. Thus, we have adjusted the percentages in the last column to the available data. The lower limits (LL) and upper limits (UL) for facility-based deaths and civil registration deaths and percentages are based on the variation in monthly reporting, and those for the national survey are based on the survey margin error of ±5%. See materials and methods for the calculation formulas used for absolute excess deaths and relative excess mortality for the three data sources.