Algorithmic decision making has proliferated and now impacts our daily lives in both mundane and consequential ways. Machine learning practitioners make use of a myriad of algorithms for predictive models in applications as diverse as movie recommendations, medical diagnoses, and parole recommendations without delving into the reasons driving specific predictive decisions. Machine learning algorithms in such applications are often chosen for their superior performance, however popular choices such as random forest and deep neural networks fail to provide an interpretable understanding of the predictive model. In recent years, rule-based algorithms have been used to address this issue. Wang et al. (2017) presented an or-of-and (disjunctive normal form) based classification technique that allows for classification rule mining of a single class in a binary classification; this method is also shown to perform comparably to other modern algorithms. In this work, we extend this idea to provide classification rules for both classes simultaneously. That is, we provide a distinct set of rules for both positive and negative classes. In describing this approach, we also present a novel and complete taxonomy of classifications that clearly capture and quantify the inherent ambiguity in noisy binary classifications in the real world. We show that this approach leads to a more granular formulation of the likelihood model and a simulated-annealing based optimization achieves classification performance competitive with comparable techniques. We apply our method to synthetic as well as real world data sets to compare with other related methods that demonstrate the utility of our proposal.
翻译:机器学习从业者在电影建议、医学诊断和假释建议等多种应用应用中,使用各种算法来预测模型,而不必细细考虑促使作出具体预测决定的原因。 在这类应用中,机器学习算法往往因其优异性而选择,然而,随机森林和深神经网络等大众选择无法提供对预测模式的可解释理解性理解。近年来,基于规则的算法被用来解决这一问题。 Wang et al. (2017年) 提供了一种或多种(常规形式的)基于分类技术,允许对分类规则进行分类,在二元分类中挖掘单一类别;这种方法也显示与其他现代算法相匹配。在这项工作中,我们将这一理念扩大到为这两个类别同时提供分类规则,即随机森林和深层神经网络等大众选择无法提供对预测性模型的可解释性理解性理解性理解。在描述这一方法时,我们还提出了一种新颖和完整的分类方法,即清晰地记录和量化了我们以宁比性比性方法在现实世界中的内在实用性比较方法,我们展示了一种具有可比性的精准性的最佳方法。