Markov 基基于 Markov 的子取样标准 (Markov subsampling based Huber Criterion) - 专知论文

会员服务 ·

0

子采样 · Performer · 准则 · Extensibility · 样本 ·

2021 年 12 月 12 日

Markov subsampling based Huber Criterion

翻译：Markov 基基于 Markov 的子取样标准

Tieliang Gong,Yuxin Dong,Hong Chen,Bo Dong,Chen Li

Subsampling is an important technique to tackle the computational challenges brought by big data. Many subsampling procedures fall within the framework of importance sampling, which assigns high sampling probabilities to the samples appearing to have big impacts. When the noise level is high, those sampling procedures tend to pick many outliers and thus often do not perform satisfactorily in practice. To tackle this issue, we design a new Markov subsampling strategy based on Huber criterion (HMS) to construct an informative subset from the noisy full data; the constructed subset then serves as a refined working data for efficient processing. HMS is built upon a Metropolis-Hasting procedure, where the inclusion probability of each sampling unit is determined using the Huber criterion to prevent over scoring the outliers. Under mild conditions, we show that the estimator based on the subsamples selected by HMS is statistically consistent with a sub-Gaussian deviation bound. The promising performance of HMS is demonstrated by extensive studies on large scale simulations and real data examples.

翻译：子取样是应对海量数据带来的计算挑战的重要方法。许多子取样程序属于重要取样框架的范围,它给样本的抽样概率定得很高,似乎具有很大的影响。当噪音水平高时,这些取样程序往往会挑选许多外出点,因此在实践中往往不能令人满意地发挥作用。为了解决这一问题,我们根据Huber标准设计了一个新的Markov子取样战略,以从噪音的完整数据中构建一个信息子集;然后,建造的子集作为高效处理的精细工作数据。HMS建立在大都会-Hasting程序的基础上,在这个程序的基础上,每个采样单位的列入概率是使用Huber标准来决定的,以防止超过外出点。在温和的条件下,我们表明基于HMS所选的子样本的测算器在统计上与亚高加索偏离界限相一致。大型模拟和真实数据实例的广泛研究表明HMS有良好的表现。

0

相关内容

子采样

「小样本深度学习图像识别」最新2022综述

「小样本深度学习图像识别」最新2022综述

专知会员服务

102+阅读 · 2022年1月15日

图像去噪方法概述

专知会员服务

43+阅读 · 2021年8月30日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

极市平台

16+阅读 · 2018年1月20日

推荐｜Andrew Ng计算机视觉教程总结

推荐｜Andrew Ng计算机视觉教程总结

全球人工智能

3+阅读 · 2017年11月23日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

极市平台

5+阅读 · 2017年6月15日

Sparse Markov Models for High-dimensional Inference

Arxiv

0+阅读 · 2022年2月16日

Predictability and Surprise in Large Generative Models

Arxiv

0+阅读 · 2022年2月15日

Computer Vision and Normalizing Flow-Based Defect Detection

Arxiv

0+阅读 · 2022年2月14日

Deep Probability Estimation

Arxiv

0+阅读 · 2022年2月14日

A new measure for assessment of clustering based on kernel density estimation

Arxiv

0+阅读 · 2022年2月13日

A Unified Prediction Framework for Signal Maps

Arxiv

0+阅读 · 2022年2月12日

Conformal prediction for the design problem

Arxiv

0+阅读 · 2022年2月11日

Parameter uncertainty estimation for exponential semi-variogram models: Two generalized bootstrap methods with check- and quantile-based filtering

Parameter uncertainty estimation for exponential semi-variogram models: Two generalized bootstrap methods with check- and quantile-based filtering

Arxiv

0+阅读 · 2022年2月11日

Inference and FDR Control for Simulated Ising Models in High-dimension

Arxiv

0+阅读 · 2022年2月11日

Fitting Sparse Markov Models to Categorical Time Series Using Regularization

Arxiv

0+阅读 · 2022年2月11日

VIP会员

文章信息

相关主题

相关VIP内容

「小样本深度学习图像识别」最新2022综述

「小样本深度学习图像识别」最新2022综述

专知会员服务

102+阅读 · 2022年1月15日

图像去噪方法概述

专知会员服务

43+阅读 · 2021年8月30日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《动态作战支援演习框架构建》80页

《大规模作战行动中自动化战场创伤系统的概念验证》

《自适应训练辅助系统概念导论及其在空战指挥官加速培训中的应用》125页

《美陆军近战整合企业现代化计划（2025—2026）》最新报告

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

IBM新论文|SamplePairing：针对图像处理领域的高效数据增强方式

极市平台

16+阅读 · 2018年1月20日

推荐｜Andrew Ng计算机视觉教程总结

推荐｜Andrew Ng计算机视觉教程总结

全球人工智能

3+阅读 · 2017年11月23日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

极市平台

5+阅读 · 2017年6月15日

相关论文

Sparse Markov Models for High-dimensional Inference

Arxiv

0+阅读 · 2022年2月16日

Predictability and Surprise in Large Generative Models

Arxiv

0+阅读 · 2022年2月15日

Computer Vision and Normalizing Flow-Based Defect Detection

Arxiv

0+阅读 · 2022年2月14日

Deep Probability Estimation

Arxiv

0+阅读 · 2022年2月14日

A new measure for assessment of clustering based on kernel density estimation

Arxiv

0+阅读 · 2022年2月13日

A Unified Prediction Framework for Signal Maps

Arxiv

0+阅读 · 2022年2月12日

Conformal prediction for the design problem

Arxiv

0+阅读 · 2022年2月11日

Parameter uncertainty estimation for exponential semi-variogram models: Two generalized bootstrap methods with check- and quantile-based filtering

Parameter uncertainty estimation for exponential semi-variogram models: Two generalized bootstrap methods with check- and quantile-based filtering

Arxiv

0+阅读 · 2022年2月11日

Inference and FDR Control for Simulated Ising Models in High-dimension

Arxiv

0+阅读 · 2022年2月11日

Fitting Sparse Markov Models to Categorical Time Series Using Regularization

Arxiv

0+阅读 · 2022年2月11日

微信扫码咨询专知VIP会员