Assessing the validity of a real-world system with respect to given quality criteria is a common yet costly task in industrial applications due to the vast number of required real-world tests. Validating such systems by means of simulation offers a promising and less expensive alternative, but requires an assessment of the simulation accuracy and therefore end-to-end measurements. Additionally, covariate shifts between simulations and actual usage can cause difficulties for estimating the reliability of such systems. In this work, we present a validation method that propagates bounds on distributional discrepancy measures through a composite system, thereby allowing us to derive an upper bound on the failure probability of the real system from potentially inaccurate simulations. Each propagation step entails an optimization problem, where -- for measures such as maximum mean discrepancy (MMD) -- we develop tight convex relaxations based on semidefinite programs. We demonstrate that our propagation method yields valid and useful bounds for composite systems exhibiting a variety of realistic effects. In particular, we show that the proposed method can successfully account for data shifts within the experimental design as well as model inaccuracies within the used simulation.
翻译:评估真实世界系统在特定质量标准方面的有效性是工业应用中一项常见但成本高昂的任务,因为需要进行大量真实世界测试。通过模拟来验证这些系统提供了一种有希望和不太昂贵的替代方法,但需要评估模拟准确性,从而需要评估端到端到端测量。此外,模拟和实际使用之间的千变万化会给估计这种系统的可靠性造成困难。在这项工作中,我们提出了一个验证方法,通过综合系统传播分布差异计量的界限,从而使我们能够从可能不准确的模拟中得出对真实系统失灵概率的上限。每个传播步骤都会产生优化问题,因为对于诸如最大平均差异(MMD)等措施,我们根据半成型程序发展紧紧的矩形松动。我们证明,我们的传播方法为综合系统提供了有效而有用的界限,显示出各种现实的影响。我们特别表明,拟议的方法能够成功地计算实验设计中的数据变化以及所用模拟中的模型的不准确性。