In recent years fully-parametric fast simulation methods based on generative models have been proposed for a variety of high-energy physics detectors. By their nature, the quality of data-driven models degrades in the regions of the phase space where the data are sparse. Since machine-learning models are hard to analyse from the physical principles, the commonly used testing procedures are performed in a data-driven way and can't be reliably used in such regions. In our work we propose three methods to estimate the uncertainty of generative models inside and outside of the training phase space region, along with data-driven calibration techniques. A test of the proposed methods on the LHCb RICH fast simulation is also presented.
翻译:近年来,为各种高能物理探测器提出了基于基因模型的完全参数快速模拟方法,从性质上看,数据驱动模型的质量在数据稀少的阶段空间区域会下降,由于机器学习模型很难从物理原理中分析,通常使用的测试程序是以数据驱动方式进行的,无法在这类区域可靠地使用。在我们的工作中,我们提出了三种方法,用以估计培训阶段空间区域内外基因模型的不确定性,以及数据驱动校准技术。还介绍了关于LHCb RICH快速模拟的拟议方法的测试。