Machine learning systems have been extensively used as auxiliary tools in domains that require critical decision-making, such as healthcare and criminal justice. The explainability of decisions is crucial for users to develop trust on these systems. In recent years, the globally-consistent rule-based summary-explanation and its max-support (MS) problem have been proposed, which can provide explanations for particular decisions along with useful statistics of the dataset. However, globally-consistent summary-explanations with limited complexity typically have small supports, if there are any. In this paper, we propose a relaxed version of summary-explanation, i.e., the $q$-consistent summary-explanation, which aims to achieve greater support at the cost of slightly lower consistency. The challenge is that the max-support problem of $q$-consistent summary-explanation (MSqC) is much more complex than the original MS problem, resulting in over-extended solution time using standard branch-and-bound solvers. To improve the solution time efficiency, this paper proposes the weighted column sampling~(WCS) method based on solving smaller problems by sampling variables according to their simplified increase support (SIS) values. Experiments verify that solving MSqC with the proposed SIS-based WCS method is not only more scalable in efficiency, but also yields solutions with greater support and better global extrapolation effectiveness.
翻译:在需要关键决策的领域,例如保健和刑事司法领域,机器学习系统被广泛用作辅助工具,在需要关键决策的领域,例如医疗保健和刑事司法,决策的解释性对于用户建立对这些系统的信任至关重要;近年来,提出了全球一致的基于规则的简要估算及其最大支持问题,这可以为特定决定提供解释,同时提供数据集的有用统计数据;然而,全球一致的、复杂性有限的简要估算通常支持较少,如果有的话,则支持时间过长。为了提高解决方案的时间效率,本文件提议采用加权的列表取样方法,即美元一致的简要估算方法,目的是以略低一致性的代价获得更大的支持; 挑战在于,美元一致的简要估算及其最大支持问题,以及数据集的有用统计; 与最初的MSM问题相比,最大支持问题要复杂得多,因此,如果使用标准的分支和有限解答器,那么过长的解决方案通常支持时间就会过长。 为了提高解决方案的时间效率,本文件提议采用基于更简化的模型取样方法(WCSAS)进行加权的取样方法,而不是以更简化的SISSIS进行更精确的测试,从而更精确地核查。