优化重加权数据鲁棒性的似然函数 (Robustifying likelihoods by optimistically re-weighting data) - 专知论文

会员服务 ·

0

似然 · MoDELS · 推断 · 数据生成过程 · Weight ·

2023 年 3 月 19 日

Robustifying likelihoods by optimistically re-weighting data

翻译：优化重加权数据鲁棒性的似然函数

Miheer Dewaskar,Christopher Tosh,Jeremias Knoblauch,David B. Dunson

from arxiv, Python code available at https://github.com/cjtosh/owl

Likelihood-based inferences have been remarkably successful in wide-spanning application areas. However, even after due diligence in selecting a good model for the data at hand, there is inevitably some amount of model misspecification: outliers, data contamination or inappropriate parametric assumptions such as Gaussianity mean that most models are at best rough approximations of reality. A significant practical concern is that for certain inferences, even small amounts of model misspecification may have a substantial impact; a problem we refer to as brittleness. This article attempts to address the brittleness problem in likelihood-based inferences by choosing the most model friendly data generating process in a discrepancy-based neighbourhood of the empirical measure. This leads to a new Optimistically Weighted Likelihood (OWL), which robustifies the original likelihood by formally accounting for a small amount of model misspecification. Focusing on total variation (TV) neighborhoods, we study theoretical properties, develop inference algorithms and illustrate the methodology in applications to mixture models and regression.

翻译：似然度的推理在广泛的应用领域中取得了显著的成功。然而，即使在为手头数据选择了好的模型之后，仍然不可避免地存在一定程度的模型错误拟合：异常值、数据污染或不合适的参数假设，如高斯性，使得大多数模型最多只是现实的粗略近似。一个重要的实际问题是，在某些推理中，即使存在小量的模型错误拟合，也可能产生显著影响；我们将此问题称为脆性问题。本文试图通过在经验测量的差异性邻域中选择最适合模型的数据生成过程来解决基于似然度推理中的脆性问题。这导致了一种新的“乐观加权似然度” (OWL) 方法，通过正式考虑少量的模型错误拟合来增强原始似然度的鲁棒性。聚焦在总变差 (TV) 邻域，我们研究了理论性质，开发了推理算法，并在混合模型和回归应用中说明了该方法。

0

相关内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

专知

15+阅读 · 2018年5月28日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

基于似然函数的统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

非参数变换模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

一类半参数时间序列模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

基于三元粗糙输出编码的带自适应惩罚因子的支持向量机多分类模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

复杂数据下联合均值与方差模型的统计推断

国家自然科学基金

1+阅读 · 2012年12月31日

基于函数拟合/变换的非线性估计理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation

Arxiv

0+阅读 · 2023年5月9日

Probabilistic Detection of GNSS Spoofing using Opportunistic Information

Arxiv

0+阅读 · 2023年5月9日

Understanding Noise-Augmented Training for Randomized Smoothing

Arxiv

0+阅读 · 2023年5月8日

CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

Arxiv

0+阅读 · 2023年5月8日

Locally Simultaneous Inference

Arxiv

0+阅读 · 2023年5月7日

Fast parameter estimation of Generalized Extreme Value distribution using Neural Networks

Arxiv

0+阅读 · 2023年5月7日

On High-dimensional and Low-rank Tensor Bandits

Arxiv

0+阅读 · 2023年5月6日

CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation

Arxiv

0+阅读 · 2023年5月5日

SAT-Inspired Higher-Order Eliminations

Arxiv

0+阅读 · 2023年5月5日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

VIP会员

文章信息

相关主题

数据生成过程

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

专知

15+阅读 · 2018年5月28日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation

Arxiv

0+阅读 · 2023年5月9日

Probabilistic Detection of GNSS Spoofing using Opportunistic Information

Arxiv

0+阅读 · 2023年5月9日

Understanding Noise-Augmented Training for Randomized Smoothing

Arxiv

0+阅读 · 2023年5月8日

CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

Arxiv

0+阅读 · 2023年5月8日

Locally Simultaneous Inference

Arxiv

0+阅读 · 2023年5月7日

Fast parameter estimation of Generalized Extreme Value distribution using Neural Networks

Arxiv

0+阅读 · 2023年5月7日

On High-dimensional and Low-rank Tensor Bandits

Arxiv

0+阅读 · 2023年5月6日

CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation

Arxiv

0+阅读 · 2023年5月5日

SAT-Inspired Higher-Order Eliminations

Arxiv

0+阅读 · 2023年5月5日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

相关基金

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

基于似然函数的统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

非参数变换模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

一类半参数时间序列模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

基于三元粗糙输出编码的带自适应惩罚因子的支持向量机多分类模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

复杂数据下联合均值与方差模型的统计推断

国家自然科学基金

1+阅读 · 2012年12月31日

基于函数拟合/变换的非线性估计理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员