In Multiple Instance Learning (MIL), models are trained using bags of instances, where only a single label is provided for each bag. A bag label is often only determined by a handful of key instances within a bag, making it difficult to interpret what information a classifier is using to make decisions. In this work, we establish the key requirements for interpreting MIL models. We then go on to develop several model-agnostic approaches that meet these requirements. Our methods are compared against existing inherently interpretable MIL models on several datasets, and achieve an increase in interpretability accuracy of up to 30%. We also examine the ability of the methods to identify interactions between instances and scale to larger datasets, improving their applicability to real-world problems.
翻译:在多实例学习(MIL)中,模型是用实例袋培训的,每个包袋只提供单一标签。包袋标签往往只由包袋内少数关键实例决定,因此难以解释分类者在决策时使用哪些信息。在这项工作中,我们为解释MIL模型确定了关键要求。然后,我们继续制定几种符合这些要求的模型 -- -- 不可知性方法。我们的方法与几个数据集上现有的可解释的MIL模型进行比较,并达到最高30%的可解释性准确度。我们还审查了查明实例和规模之间相互作用的方法与较大数据集之间互动的能力,提高了它们对现实世界问题的适用性。