We introduce a new rule-based optimization method for classification with constraints. The proposed method takes advantage of linear programming and column generation, and hence, is scalable to large datasets. Moreover, the method returns a set of rules along with their optimal weights indicating the importance of each rule for learning. Through assigning cost coefficients to the rules and introducing additional constraints, we show that one can also consider interpretability and fairness of the results. We test the performance of the proposed method on a collection of datasets and present two case studies to elaborate its different aspects. Our results show that a good compromise between interpretability and fairness on the one side, and accuracy on the other side, can be obtained by the proposed rule-based learning method.
翻译:我们采用新的基于规则的优化方法进行有限制的分类。拟议方法利用线性编程和列生成,因此可以向大型数据集推广。此外,该方法还返回一套规则及其最佳权重,表明每项规则对学习的重要性。我们通过给规则分配成本系数和引入额外的制约,表明人们也可以考虑结果的可解释性和公平性。我们测试了拟议方法在收集数据集方面的绩效,并提出了两个案例研究,以详细阐述其不同方面。我们的结果表明,拟议的基于规则的学习方法可以取得对一方的解释性和公正性与另一方准确性之间的良好妥协。