在受扰动的大维回归环境下强有力地选择预测器和有条件外星探测 (Robust selection of predictors and conditional outlier detection in a perturbed large-dimensional regression context) - 专知论文

会员服务 ·

0

预测器/决策函数 · 异常点 · 可辨认的 · 稳健性 · 情景 ·

2021 年 4 月 25 日

Robust selection of predictors and conditional outlier detection in a perturbed large-dimensional regression context

翻译：在受扰动的大维回归环境下强有力地选择预测器和有条件外星探测

Matteo Farnè,Angelos Vouldis

This paper presents a fast methodology, called ROBOUT, to identify outliers in a response variable conditional on a set of linearly related predictors, retrieved from a large granular dataset. ROBOUT is shown to be effective and particularly versatile compared to existing methods in the presence of a number of data idiosyncratic features. ROBOUT is able to identify observations with outlying conditional variance when the dataset contains element-wise sparse variables, and the set of predictors contains multivariate outliers. Existing integrated methodologies like SPARSE-LTS and RLARS are systematically sub-optimal under those conditions. ROBOUT entails a robust selection stage of the statistically relevant predictors (by using a Huber or a quantile loss), the estimation of a robust regression model based on the selected predictors (by LTS, GS or MM), and a criterion to identify conditional outliers based on a robust measure of the residuals' dispersion. We conduct a comprehensive simulation study in which the different variants of the proposed algorithm are tested under an exhaustive set of different perturbation scenarios. The methodology is also applied to a granular supervisory banking dataset collected by the European Central Bank.

翻译：本文介绍了一种快速的方法,称为ROBOUT,以从大型颗粒数据集中检索到的一组线性相关预测器为条件,确定响应变量中的异常值。ROBOUT显示,与现有方法相比,在存在一些数据特点的情况下,ROBOUT与现有方法相比是有效而且特别多功能的。ROBOUT能够在数据集包含元素偏少变量时确定观测结果,而预测器组包含多变量。在这些条件下,SPARSE-LTS和RLARRS等现有综合方法是系统性的次最佳方法。ROBOUT包含一个与统计相关的预测器的强有力选择阶段(使用Huber或量值损失),根据选定的预测器(用LTS、GS或MMM)估算稳健的回归模型,以及根据对残余分布的稳健度测量确定有条件外部值的标准。我们进行一项全面的模拟研究,在这些条件下,根据不同的扰动假设情景,对拟议算法的不同变量进行系统的测试。该方法还应用了由欧洲监督银行收集的谷仓数据。

0

相关内容

预测器/决策函数

预测器/决策函数

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

《强化学习》简介小册，24页pdf

《强化学习》简介小册，24页pdf

专知会员服务

277+阅读 · 2020年4月19日

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

专知会员服务

49+阅读 · 2020年2月25日

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

专知会员服务

32+阅读 · 2020年2月24日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【论文推荐】深度学习中贝叶斯不确定性简单基线（A simple baseline for bayesian uncertainty in deep learning）

【论文推荐】深度学习中贝叶斯不确定性简单基线（A simple baseline for bayesian uncertainty in deep learning）

专知会员服务

46+阅读 · 2019年12月25日

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

专知会员服务

12+阅读 · 2019年12月19日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

13+阅读 · 2019年11月22日

【CCF优秀博士学位论文奖-2019初评】基于深度学习的场景分割技术研究，中科院计算所张蕊

【CCF优秀博士学位论文奖-2019初评】基于深度学习的场景分割技术研究，中科院计算所张蕊

专知会员服务

32+阅读 · 2019年11月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2017年12月5日

【推荐】视频目标分割基础

【推荐】视频目标分割基础

机器学习研究会

9+阅读 · 2017年9月19日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Learning Incident Prediction Models Over Large Geographical Areas for Emergency Response Systems

Learning Incident Prediction Models Over Large Geographical Areas for Emergency Response Systems

Arxiv

0+阅读 · 2021年6月15日

Sparse Regression for Extreme Values

Arxiv

0+阅读 · 2021年6月14日

Inference with generalizable classifier predictions

Arxiv

0+阅读 · 2021年6月14日

Outlier detection in multivariate functional data through a contaminated mixture model

Arxiv

0+阅读 · 2021年6月14日

Optimal detection of the feature matching map in presence of noise and outliers

Arxiv

0+阅读 · 2021年6月13日

Robust Gaussian Process Regression Based on Iterative Trimming

Arxiv

0+阅读 · 2021年6月13日

Regularized Estimation of High-Dimensional Vector AutoRegressions with Weakly Dependent Innovations

Arxiv

0+阅读 · 2021年6月12日

Structured Bayesian variable selection for multiple correlated response variables and high-dimensional predictors

Arxiv

0+阅读 · 2021年6月11日

A Bayesian spatio-temporal error correction analysis of markets during the Finnish 1860s famine

Arxiv

0+阅读 · 2021年6月11日

Learning Region Features for Object Detection

Arxiv

4+阅读 · 2018年3月19日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

《强化学习》简介小册，24页pdf

《强化学习》简介小册，24页pdf

专知会员服务

277+阅读 · 2020年4月19日

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

专知会员服务

49+阅读 · 2020年2月25日

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

专知会员服务

32+阅读 · 2020年2月24日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【论文推荐】深度学习中贝叶斯不确定性简单基线（A simple baseline for bayesian uncertainty in deep learning）

【论文推荐】深度学习中贝叶斯不确定性简单基线（A simple baseline for bayesian uncertainty in deep learning）

专知会员服务

46+阅读 · 2019年12月25日

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

专知会员服务

12+阅读 · 2019年12月19日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

13+阅读 · 2019年11月22日

【CCF优秀博士学位论文奖-2019初评】基于深度学习的场景分割技术研究，中科院计算所张蕊

【CCF优秀博士学位论文奖-2019初评】基于深度学习的场景分割技术研究，中科院计算所张蕊

专知会员服务

32+阅读 · 2019年11月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2017年12月5日

【推荐】视频目标分割基础

【推荐】视频目标分割基础

机器学习研究会

9+阅读 · 2017年9月19日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Learning Incident Prediction Models Over Large Geographical Areas for Emergency Response Systems

Learning Incident Prediction Models Over Large Geographical Areas for Emergency Response Systems

Arxiv

0+阅读 · 2021年6月15日

Sparse Regression for Extreme Values

Arxiv

0+阅读 · 2021年6月14日

Inference with generalizable classifier predictions

Arxiv

0+阅读 · 2021年6月14日

Outlier detection in multivariate functional data through a contaminated mixture model

Arxiv

0+阅读 · 2021年6月14日

Optimal detection of the feature matching map in presence of noise and outliers

Arxiv

0+阅读 · 2021年6月13日

Robust Gaussian Process Regression Based on Iterative Trimming

Arxiv

0+阅读 · 2021年6月13日

Regularized Estimation of High-Dimensional Vector AutoRegressions with Weakly Dependent Innovations

Arxiv

0+阅读 · 2021年6月12日

Structured Bayesian variable selection for multiple correlated response variables and high-dimensional predictors

Arxiv

0+阅读 · 2021年6月11日

A Bayesian spatio-temporal error correction analysis of markets during the Finnish 1860s famine

Arxiv

0+阅读 · 2021年6月11日

Learning Region Features for Object Detection

Arxiv

4+阅读 · 2018年3月19日

微信扫码咨询专知VIP会员