We propose a new active learning (AL) framework, Active Learning++, which can utilize an annotator's labels as well as its rationale. Annotators can provide their rationale for choosing a label by ranking input features based on their importance for a given query. To incorporate this additional input, we modified the disagreement measure for a bagging-based Query by Committee (QBC) sampling strategy. Instead of weighing all committee models equally to select the next instance, we assign higher weight to the committee model with higher agreement with the annotator's ranking. Specifically, we generated a feature importance-based local explanation for each committee model. The similarity score between feature rankings provided by the annotator and the local model explanation is used to assign a weight to each corresponding committee model. This approach is applicable to any kind of ML model using model-agnostic techniques to generate local explanation such as LIME. With a simulation study, we show that our framework significantly outperforms a QBC based vanilla AL framework.
翻译:我们建议一个新的积极学习框架(AL)框架(积极学习++),它可以使用说明者的标签及其理由。说明者可以提供其根据对特定查询的重要性通过排名输入特征选择标签的理由。为了纳入这一额外投入,我们修改了委员会(QBC)基于包装的查询方法的分歧措施。我们没有对所有委员会模型进行同等权衡以选择下一个实例,而是给委员会模型赋予与说明者等级比较一致的更高份量。具体地说,我们为每个委员会模型提出了一个基于特点的重要性的地方解释。说明者提供的特征排名和当地模型解释之间的相似性评分被用来给每个相应的委员会模式分配权重。这一方法适用于使用模型学技术产生当地解释(如LIME)的任何类型的ML模型。我们通过模拟研究,表明我们的框架大大超越了基于QBC的 Vanilla AL框架。