Data-driven optimization uses contextual information and machine learning algorithms to find solutions to decision problems with uncertain parameters. While a vast body of work is dedicated to interpreting machine learning models in the classification setting, explaining decision pipelines involving learning algorithms remains unaddressed. This lack of interpretability can block the adoption of data-driven solutions as practitioners may not understand or trust the recommended decisions. We bridge this gap by introducing a counterfactual explanation methodology tailored to explain solutions to data-driven problems. We introduce two classes of explanations and develop methods to find nearest explanations of random forest and nearest-neighbor predictors. We demonstrate our approach by explaining key problems in operations management such as inventory management and routing.
翻译:数据驱动优化利用背景信息和机器学习算法来寻找解决不确定参数决定问题的办法。虽然大量工作致力于在分类设置中解释机器学习模式,但解释涉及学习算法的决策管道的问题仍未得到解决。这种缺乏解释性的做法可能阻碍采用数据驱动解决方案,因为从业人员可能不理解或不相信建议的决定。我们通过采用反事实解释方法来弥补这一差距,我们专门为解释数据驱动问题的解决办法而采用反事实解释方法。我们引入了两类解释,并开发了方法,以找到随机森林和近邻预测器的最接近的解释。我们通过解释诸如库存管理和路由等业务管理方面的关键问题来展示我们的方法。