Hybrid modelling reduces the misspecification of expert models by combining them with machine learning (ML) components learned from data. Like for many ML algorithms, hybrid model performance guarantees are limited to the training distribution. Leveraging the insight that the expert model is usually valid even outside the training domain, we overcome this limitation by introducing a hybrid data augmentation strategy termed \textit{expert augmentation}. Based on a probabilistic formalization of hybrid modelling, we show why expert augmentation improves generalization. Finally, we validate the practical benefits of augmented hybrid models on a set of controlled experiments, modelling dynamical systems described by ordinary and partial differential equations.
翻译:混合模型将专家模型与从数据中吸取的机器学习(ML)组件结合起来,从而减少了专家模型的错误区分。与许多ML算法一样,混合模型性能保障仅限于培训分布。利用专家模型通常即使在培训领域以外也有效的洞察力,我们通过采用称为\ textit{专家增强}的混合数据增强战略克服了这一限制。根据混合模型的概率性正规化,我们证明专家增强之所以能改进一般化。最后,我们验证了在一套受控实验上强化混合模型的实际好处,以及以普通和部分差异方程式描述的模拟动态系统。