Empirical risk minimization (ERM) is typically designed to perform well on the average loss, which can result in estimators that are sensitive to outliers, generalize poorly, or treat subgroups unfairly. While many methods aim to address these problems individually, in this work, we explore them through a unified framework -- tilted empirical risk minimization (TERM). In particular, we show that it is possible to flexibly tune the impact of individual losses through a straightforward extension to ERM using a hyperparameter called the tilt. We provide several interpretations of the resulting framework: We show that TERM can increase or decrease the influence of outliers, respectively, to enable fairness or robustness; has variance-reduction properties that can benefit generalization; and can be viewed as a smooth approximation to a superquantile method. We develop batch and stochastic first-order optimization methods for solving TERM, and show that the problem can be efficiently solved relative to common alternatives. Finally, we demonstrate that TERM can be used for a multitude of applications, such as enforcing fairness between subgroups, mitigating the effect of outliers, and handling class imbalance. TERM is not only competitive with existing solutions tailored to these individual problems, but can also enable entirely new applications, such as simultaneously addressing outliers and promoting fairness.
翻译:风险最小化(ERM)通常是为了在平均损失上取得良好的效果,这可能导致对外部值敏感、普遍化或不公平地对待子群的估测因素,尽管许多方法旨在单独解决这些问题,但在这项工作中,我们通过一个统一框架 -- -- 倾斜的经验风险最小化(TERM)来探索这些问题。我们特别表明,通过使用称为倾斜的超参数直接扩展到机构风险管理,可以灵活地调整个人损失的影响。我们提供了对由此产生的框架的若干解释:我们表明,TER可以增加或降低外部值的影响,从而分别实现公平或稳健;具有减少差异的特性,从而有利于普遍化;并且可以被视为一种超量化方法的平稳近似。我们开发了解决长期值的批量和随机一级优化方法,并表明问题可以与普通的替代方法相对有效地解决。最后,我们证明TERM可用于多种应用,例如加强分群之间的公平性,减轻外部值的影响,以及处理类别失衡问题。TERM是不仅能够同时使现有各种解决办法具有竞争性,而且能够使新的解决办法能够同时促进这些不同的应用。