Data-IQ:在表格数据中说明具有不同结果的分组的特征 (Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data)

High model performance, on average, can hide that models may systematically underperform on subgroups of the data. We consider the tabular setting, which surfaces the unique issue of outcome heterogeneity - this is prevalent in areas such as healthcare, where patients with similar features can have different outcomes, thus making reliable predictions challenging. To tackle this, we propose Data-IQ, a framework to systematically stratify examples into subgroups with respect to their outcomes. We do this by analyzing the behavior of individual examples during training, based on their predictive confidence and, importantly, the aleatoric (data) uncertainty. Capturing the aleatoric uncertainty permits a principled characterization and then subsequent stratification of data examples into three distinct subgroups (Easy, Ambiguous, Hard). We experimentally demonstrate the benefits of Data-IQ on four real-world medical datasets. We show that Data-IQ's characterization of examples is most robust to variation across similarly performant (yet different) models, compared to baselines. Since Data-IQ can be used with any ML model (including neural networks, gradient boosting etc.), this property ensures consistency of data characterization, while allowing flexible model selection. Taking this a step further, we demonstrate that the subgroups enable us to construct new approaches to both feature acquisition and dataset selection. Furthermore, we highlight how the subgroups can inform reliable model usage, noting the significant impact of the Ambiguous subgroup on model generalization.

翻译：平均而言,高模型性能可以掩盖模型在数据分组上可能系统性地表现不佳的情况。我们考虑表格设置,该设置展示了结果差异性的独特问题,在保健等领域很普遍,具有类似特征的病人可以产生不同的结果,从而作出可靠的预测。为了解决这个问题,我们提议Data-IQ,这是一个将实例系统地分解为分组分析其结果的框架。我们这样做的方法是分析培训期间个别实例的行为,基于其预测性信心,以及重要的是,阅读(数据)不确定性。如果掌握了清晰的不确定性,就可以有原则地定性,然后将数据实例分解成三个不同的分组(如保健,具有相似特征的病人可以产生不同的结果,因此我们试验性地展示了数据-IQ在四个真实世界医学数据集上的好处。我们显示,Data-IQ对实例的描述最有力,可以使类似性(目前不同)模型的变异)模型与基线相比较。由于数据-IQ可以用于任何模型模型模型(包括神经性网络、梯度推动模型推动新的模型),我们又可以进一步确保数据结构的变异化。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日