A central problem in business concerns the optimal allocation of limited resources to a set of available tasks, where the payoff of these tasks is inherently uncertain. In credit card fraud detection, for instance, a bank can only assign a small subset of transactions to their fraud investigations team. Typically, such problems are solved using a classification framework, where the focus is on predicting task outcomes given a set of characteristics. Resources are then allocated to the tasks that are predicted to be the most likely to succeed. However, we argue that using classification to address task uncertainty is inherently suboptimal as it does not take into account the available capacity. Therefore, we first frame the problem as a type of assignment problem. Then, we present a novel solution using learning to rank by directly optimizing the assignment's expected profit given limited, stochastic capacity. This is achieved by optimizing a specific instance of the net discounted cumulative gain, a commonly used class of metrics in learning to rank. Empirically, we demonstrate that our new method achieves higher expected profit and expected precision compared to a classification approach for a wide variety of application areas and data sets. This illustrates the benefit of an integrated approach and of explicitly considering the available resources when learning a predictive model.
翻译:业务的一个中心问题是将有限资源最佳地分配给一组现有任务,而这些工作的回报本来就是不确定的。例如,在发现信用卡欺诈时,银行只能将一小部分交易分配给其欺诈调查组。通常,这些问题利用分类框架来解决,重点是预测任务结果,并有一套特点。然后将资源分配给预计最有可能成功的任务。然而,我们争辩说,使用分类处理任务不确定性本身是不最理想的,因为它没有考虑到现有能力。因此,我们首先将问题描述为一种任务分配问题。然后,我们提出一种新的解决办法,通过直接优化任务预期利润的评级,直接优化任务预期利润,给予有限、随机性能力。这是通过优化净折扣累积收益的具体实例实现的,这是学习排名时常用的计量类别。我们很生动地表明,我们的新方法与各种应用领域和数据集的分类方法相比,预期利润和预期准确性更高。这说明了在明确考虑现有资源时,采用综合方法和明确预测学习模式的好处。