Many research papers that propose models to predict the course of the COVID-19 pandemic either use handcrafted statistical models or large neural networks. Even though large neural networks are more powerful than simpler statistical models, they are especially hard to train on small datasets. This paper not only presents a model with grater flexibility than the other proposed neural networks, but also presents a model that is effective on smaller datasets. To improve performance on small data, six regularisation methods were tested. The results show that the GRU combined with 20% Dropout achieved the lowest RMSE scores. The main finding was that models with less access to data relied more on the regulariser. Applying Dropout to a GRU model trained on only 28 days of data reduced the RMSE by 23%.
翻译:许多研究论文提出了预测COVID-19大流行过程的模型,它们要么使用手工制作的统计模型,要么使用大型神经网络。尽管大型神经网络比简单的统计模型更强大,但它们在小型数据集方面特别难以培训。本文不仅提出了一个比其他拟议的神经网络灵活得多的模型,而且还提出了一个对小型数据集有效的模型。为了改进小型数据的性能,已经测试了六种规范化方法。结果显示,GRU加上20%的辍学者获得了最低的RMSE分数。主要发现,获得数据较少的模型更依赖于常规化的模型。在仅经过28天数据培训的GRU模型中,将RMSE减少了23%。