Despite the great performance of deep learning models in many areas, they still make mistakes and underperform on certain subsets of data, i.e. error slices. Given a trained model, it is important to identify its semantically coherent error slices that are easy to interpret, which is referred to as the error slice discovery problem. However, there is no proper metric of slice coherence without relying on extra information like predefined slice labels. Current evaluation of slice coherence requires access to predefined slices formulated by metadata like attributes or subclasses. Its validity heavily relies on the quality and abundance of metadata, where some possible patterns could be ignored. Besides, current algorithms cannot directly incorporate the constraint of coherence into their optimization objective due to absence of an explicit coherence metric, which could potentially hinder their effectiveness. In this paper, we propose manifold compactness, a coherence metric without reliance on extra information by incorporating the data geometry property into its design, and experiments on typical datasets empirically validate the rationality of the metric. Then we develop Manifold Compactness based error Slice Discovery (MCSD), a novel algorithm that directly treats risk and coherence as the optimization objective, and is flexible to be applied to models of various tasks. Extensive experiments on the benchmark and case studies on other typical datasets demonstrate the superiority of MCSD.
翻译:尽管深度学习模型在许多领域表现出色,但它们仍会在某些数据子集上出现错误并表现不佳,即误差切片。对于已训练的模型,识别其语义连贯且易于解释的误差切片至关重要,这一问题被称为误差切片发现问题。然而,在不依赖预定义切片标签等额外信息的情况下,目前缺乏合适的切片连贯性度量指标。现有对切片连贯性的评估需要依赖通过属性或子类等元数据定义的预定义切片,其有效性在很大程度上取决于元数据的质量和丰富程度,可能导致某些潜在模式被忽略。此外,由于缺乏明确的连贯性度量指标,现有算法无法直接将连贯性约束纳入优化目标,这可能会影响其有效性。本文提出流形紧致性,这是一种无需额外信息的连贯性度量方法,其设计融入了数据几何特性,在典型数据集上的实验从经验上验证了该指标的合理性。基于此,我们开发了基于流形紧致性的误差切片发现算法,该算法直接将风险与连贯性作为优化目标,并能灵活应用于各类任务的模型。在基准数据集上的大量实验及其他典型数据集的案例研究均证明了该算法的优越性。