The COVID-19 pandemic has caused major disturbance to human life. An important reason behind the widespread social anxiety is the huge uncertainty about the pandemic. A fundamental uncertainty is how many or what percentage of people have been infected. There are published and frequently updated data on various statistics of the pandemic, at local, country or global level. However, due to various reasons, many cases were not included in those reported numbers. We propose a structured approach for the estimation of the number of unreported cases, where we distinguish cases that arrive late in the reported numbers and those who had mild or no symptoms and thus were not captured by any medical system at all. We use post-report data for the estimation of the former and population matching to the latter. We estimate that the reported number of infected cases in the US should be corrected by multiplying a factor of 220.54% as of Apr 20, 2020, while the infection ratio out of the US population is estimated to be 0.53%, implying a case mortality rate at 2.85% which is close to the 3.4% suggested by the WHO in Mar 2020. Towards the end of the summer of 2020, the overall infection ratio of the US rises to 2.49% while the case mortality decreases to 2.09%, and the ratio of asymptomatic cases out of all infected cases reduces from the pre-summer 35-40% to around 20-25%.
翻译:COVID-19大流行给人类生活造成了重大干扰。广泛社会焦虑的一个重要原因是该流行病的巨大不确定性。一个根本的不确定性是感染者的人数或百分比。当地、国家或全球各级都有关于该流行病的各种统计数据的公布和经常更新的数据。然而,由于各种原因,许多病例没有列入报告的数字。我们建议了一种结构化的方法来估计未报告的病例数量,其中我们区分了在报告数量上较晚的病例以及那些有轻微或无症状、因此没有被任何医疗系统吸收的病例。我们使用报告后的数据来估计前一种病例和与后一种病例相匹配的人口。我们估计,美国报告的感染病例数量应该通过到2020年4月20日的220.54%的乘数加以纠正,而美国人口的感染率估计为0.53 %,这意味着病例死亡率为2.85%,接近世界卫生组织在2020年3月提出的3.4%。2020年夏季末,美国感染病例和与前后一种病例的总感染率从2.59%下降到2.59%,而感染病例的总感染率从209%下降到25.0%左右,而感染病例总数从209%下降到25 %。