Rigorous guarantees about the performance of predictive algorithms are necessary in order to ensure their responsible use. Previous work has largely focused on bounding the expected loss of a predictor, but this is not sufficient in many risk-sensitive applications where the distribution of errors is important. In this work, we propose a flexible framework to produce a family of bounds on quantiles of the loss distribution incurred by a predictor. Our method takes advantage of the order statistics of the observed loss values rather than relying on the sample mean alone. We show that a quantile is an informative way of quantifying predictive performance, and that our framework applies to a variety of quantile-based metrics, each targeting important subsets of the data distribution. We analyze the theoretical properties of our proposed method and demonstrate its ability to rigorously control loss quantiles on several real-world datasets.
翻译:为确保以负责任的方式使用预测算法,必须对预测算法的性能提供严格的保证。先前的工作主要侧重于限制预测器的预期损失,但在许多风险敏感应用中,如果错误分布很重要,这是不够的。在这项工作中,我们提出了一个灵活的框架,以产生关于预测器所造成损失分布的四分位数的界限。我们的方法利用观察到的损失值的顺序统计,而不是仅仅依靠抽样平均值。我们表明,量化是量化预测性能的一种信息化方法,我们的框架适用于各种基于量化的计量标准,每个都针对数据分布的重要子集。我们分析了我们拟议方法的理论属性,并展示了它严格控制几个真实世界数据集的损失四分位数的能力。