Explaining a classification result produced by an image- and video-classification model is one of the important but challenging issues in computer vision. Many methods have been proposed for producing heat-map--based explanations for this purpose, including ones based on the white-box approach that uses the internal information of a model (e.g., LRP, Grad-CAM, and Grad-CAM++) and ones based on the black-box approach that does not use any internal information (e.g., LIME, SHAP, and RISE). We propose a new black-box method BOREx (Bayesian Optimization for Refinement of visual model Explanation) to refine a heat map produced by any method. Our observation is that a heat-map--based explanation can be seen as a prior for an explanation method based on Bayesian optimization. Based on this observation, BOREx conducts Gaussian process regression (GPR) to estimate the saliency of each pixel in a given image starting from the one produced by another explanation method. Our experiments statistically demonstrate that the refinement by BOREx improves low-quality heat maps for image- and video-classification results.
翻译:解释通过图像和视频分类模型产生的分类结果是计算机愿景中重要但具有挑战性的问题之一。我们提出了许多方法,用于为此制作基于热映像的解释,包括基于白箱方法的方法,即使用模型的内部信息(例如,LRP、Grad-CAM和Grad-CAM++)和基于黑箱方法,即不使用任何内部信息(例如,LIME、SHAP和RISE)的分类结果。我们提议采用新的黑箱法BOREx(BAYESian优化视觉模型解释),以完善任何方法产生的热映像图。我们的观察是,基于热映像的解释可以被视为基于Bayesian优化的解释方法的事先。基于这一观察,BOREx进行了高山进程回归(GPR),以便从另一种解释方法得出的图像中估算每个像的突出度。我们进行的实验表明,通过BAREx图像改进了低质量图像。