Precipitation forecasting is an important scientific challenge that has wide-reaching impacts on society. Historically, this challenge has been tackled using numerical weather prediction (NWP) models, grounded on physics-based simulations. Recently, many works have proposed an alternative approach, using end-to-end deep learning (DL) models to replace physics-based NWP models. While these DL methods show improved performance and computational efficiency, they exhibit limitations in long-term forecasting and lack the explainability. In this work, we present a hybrid NWP-DL workflow to fill the gap between standalone NWP and DL approaches. Under this workflow, the outputs of NWP models are fed into a deep neural network, which post-processes the data to yield a refined precipitation forecast. The deep model is trained with supervision, using Automatic Weather Station (AWS) observations as ground-truth labels. This can achieve the best of both worlds, and can even benefit from future improvements in NWP technology. To facilitate study in this direction, we present a novel dataset focused on the Korean Peninsula, termed KoMet (Korea Meteorological Dataset), comprised of NWP outputs and AWS observations. For the NWP model, the Global Data Assimilation and Prediction Systems-Korea Integrated Model (GDAPS-KIM) is utilized. We provide analysis on a comprehensive set of baseline methods aimed at addressing the challenges of KoMet, including the sparsity of AWS observations and class imbalance. To lower the barrier to entry and encourage further study, we also provide an extensive open-source Python package for data processing and model development. Our benchmark data and code are available at https://github.com/osilab-kaist/KoMet-Benchmark-Dataset.
翻译:降水预报是一个重要的科学挑战,对社会具有广泛影响。历史上,这项挑战一直通过数字天气预测模型(NWP)和DL方法来解决,以物理模拟为基础。最近,许多著作提出了替代方法,用端到端深的学习模型(DL)取代物理的NWP模型。虽然这些DL方法显示业绩和计算效率的提高,但它们在长期预测方面表现出局限性,缺乏解释性。在这项工作中,我们提出了一个NWP-DL混合工作流程,以填补独立的NWP和DL方法之间的差距。在这个工作流程中,NWP模型的输出被输入到一个深层神经网络中,而后处理数据以产生改良的降水预报。 深层模型经过监督,使用自动气象站的观测作为地图标签。这可以达到两个世界的最佳效果,甚至从NWP技术的未来模型改进中受益。为了促进这方面的研究,我们提出了一个新的数据集,称为KoMet观测(Korea MMSI) 的公开观测结果, 包括ASIS 的模型和MISA系统的综合数据分析。