噪音 -- -- 燃烧 -- -- 噪音远距离分布群集</s> (Expectation Distance-based Distributional Clustering for Noise-Robustness) - 专知论文

会员服务 ·

0

簇 · 可约的 · 模型评估 · 噪声 · 讲稿 ·

2023 年 3 月 14 日

Expectation Distance-based Distributional Clustering for Noise-Robustness

翻译：噪音 -- -- 燃烧 -- -- 噪音远距离分布群集

Rahmat Adesunkanmi,Ratnesh Kumar

This paper presents a clustering technique that reduces the susceptibility to data noise by learning and clustering the data-distribution and then assigning the data to the cluster of its distribution. In the process, it reduces the impact of noise on clustering results. This method involves introducing a new distance among distributions, namely the expectation distance (denoted, ED), that goes beyond the state-of-art distribution distance of optimal mass transport (denoted, $W_2$ for $2$-Wasserstein): The latter essentially depends only on the marginal distributions while the former also employs the information about the joint distributions. Using the ED, the paper extends the classical $K$-means and $K$-medoids clustering to those over data-distributions (rather than raw-data) and introduces $K$-medoids using $W_2$. The paper also presents the closed-form expressions of the $W_2$ and ED distance measures. The implementation results of the proposed ED and the $W_2$ distance measures to cluster real-world weather data as well as stock data are also presented, which involves efficiently extracting and using the underlying data distributions -- Gaussians for weather data versus lognormals for stock data. The results show striking performance improvement over classical clustering of raw-data, with higher accuracy realized for ED. Also, not only does the distribution-based clustering offer higher accuracy, but it also lowers the computation time due to reduced time-complexity.

翻译：本文介绍了一种集群技术,通过学习和分组数据分布,降低对数据噪音的敏感度,然后将数据分配给分布组群。在这一过程中,它减少了噪音对分组结果的影响。这种方法涉及在分布中引入新的距离,即预期距离(注意,ED),这超出了最佳大众运输的最先进分布距离(注,W_2美元为$-Wasserstein2美元),后者主要取决于边际分布,而前者也使用关于联合分布的信息。在使用ED时,该文件将经典美元汇率和美元汇率组群对数据分配的影响扩大到数据分配超标值(而不是原始数据),并采用美元-美元-美元,超过最佳大众运输量(W_2美元),引入美元-美元-Wserstein措施的封闭式表达方式。拟议的ED和美元-2美元(美元)的距离测量结果对数据分组实际天气数据的分配结果,以及存量数据数据数据数据也显示,这需要以更高的时间来有效提取和精确性数据,同时用正态数据分析数据显示比正常的分类的准确性数据,用来显示数据流流压数据,并且显示对数据进行更精确的计算。</s>

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

《数学学报》期刊

国家自然科学基金

5+阅读 · 2015年12月31日

非球形冰晶粒子光散射和甲烷高光谱卫星遥感反演的研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

多时相InSAR相干性估计研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于单量子点与量子阱能量转移的室温电泵量子光源基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于三元粗糙输出编码的带自适应惩罚因子的支持向量机多分类模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

基于Grouplet变换的SAR图像压缩感知编码

国家自然科学基金

0+阅读 · 2009年12月31日

无线通信物理层网络编码与低复杂度迭代可译信道编码联合设计

国家自然科学基金

0+阅读 · 2009年12月31日

长余辉发光的分子印迹纳米探针检测海洋中的赤潮毒素

国家自然科学基金

0+阅读 · 2009年12月31日

Signature asymptotics, empirical processes, and optimal transport

Arxiv

0+阅读 · 2023年5月5日

CaloFlow: Fast and Accurate Generation of Calorimeter Showers with Normalizing Flows

Arxiv

0+阅读 · 2023年5月5日

Geodesically convex $M$-estimation in metric spaces

Arxiv

0+阅读 · 2023年5月5日

Bayesian Safety Validation for Black-Box Systems

Arxiv

0+阅读 · 2023年5月3日

GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models

Arxiv

0+阅读 · 2023年5月3日

Out-of-distribution detection algorithms for robust insect classification

Arxiv

0+阅读 · 2023年5月2日

Hamming Similarity and Graph Laplacians for Class Partitioning and Adversarial Image Detection

Arxiv

0+阅读 · 2023年5月2日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Signature asymptotics, empirical processes, and optimal transport

Arxiv

0+阅读 · 2023年5月5日

CaloFlow: Fast and Accurate Generation of Calorimeter Showers with Normalizing Flows

Arxiv

0+阅读 · 2023年5月5日

Geodesically convex $M$-estimation in metric spaces

Arxiv

0+阅读 · 2023年5月5日

Bayesian Safety Validation for Black-Box Systems

Arxiv

0+阅读 · 2023年5月3日

GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models

Arxiv

0+阅读 · 2023年5月3日

Out-of-distribution detection algorithms for robust insect classification

Arxiv

0+阅读 · 2023年5月2日

Hamming Similarity and Graph Laplacians for Class Partitioning and Adversarial Image Detection

Arxiv

0+阅读 · 2023年5月2日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

相关基金

《数学学报》期刊

国家自然科学基金

5+阅读 · 2015年12月31日

非球形冰晶粒子光散射和甲烷高光谱卫星遥感反演的研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

多时相InSAR相干性估计研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于单量子点与量子阱能量转移的室温电泵量子光源基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于三元粗糙输出编码的带自适应惩罚因子的支持向量机多分类模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

基于Grouplet变换的SAR图像压缩感知编码

国家自然科学基金

0+阅读 · 2009年12月31日

无线通信物理层网络编码与低复杂度迭代可译信道编码联合设计

国家自然科学基金

0+阅读 · 2009年12月31日

长余辉发光的分子印迹纳米探针检测海洋中的赤潮毒素

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员