高维后勤电导酶聚集 (High-dimensional logistic entropy clustering) - 专知论文

会员服务 ·

0

分离的 · 簇 · 向量化 · 经验风险 · 可辨认的 ·

2021 年 12 月 16 日

High-dimensional logistic entropy clustering

翻译：高维后勤电导酶聚集

Edouard Genetay,Adrien Saumard,Rémi Coulaud

Minimization of the (regularized) entropy of classification probabilities is a versatile class of discriminative clustering methods. The classification probabilities are usually defined through the use of some classical losses from supervised classification and the point is to avoid modelisation of the full data distribution by just optimizing the law of the labels conditioned on the observations. We give the first theoretical study of such methods, by specializing to logistic classification probabilities. We prove that if the observations are generated from a two-component isotropic Gaussian mixture, then minimizing the entropy risk over a Euclidean ball indeed allows to identify the separation vector of the mixture. Furthermore, if this separation vector is sparse, then penalizing the empirical risk by a $\ell_{1}$-regularization term allows to infer the separation in a high-dimensional space and to recover its support, at standard rates of sparsity problems. Our approach is based on the local convexity of the logistic entropy risk, that occurs if the separation vector is large enough, with a condition on its norm that is independent from the space dimension. This local convexity property also guarantees fast rates in a classical, low-dimensional setting.

翻译：最小化分类概率(正规化)的概率最小化是一个多用途的差别组合方法类别。分类概率通常通过使用监督分类中某些古典损失来界定。分类概率通常通过使用监督分类中的某些典型损失来界定,其要点是避免仅仅通过优化以观察为条件的标签法来模拟全部数据分布的模型化。我们对这种方法进行首次理论研究,专门研究后勤分类概率。我们证明,如果观测来自两种成分的异质高斯混合物,然后将欧洲二氯丁二烯球的酶风险最小化,从而确实能够确定混合物的分离矢量。此外,如果这种分离矢量稀少,然后用一个$\ell ⁇ 1}(美元)-常规化术语来惩罚经验风险,从而可以推断在高空间的分离,并以标准速度恢复其支持。我们的方法是以物流酶风险的本地共性为基础,如果分离矢量足够大,如果分离矢量的矢量足够大,且其规范的条件独立于空间层面,则会出现这种风险。此外,这种本地的惯性特性也能够快速设定。

0

相关内容

分离的

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【ACL2020-斯坦福大学】低维双曲线知识图谱嵌入，Low-Dimensional Hyperbolic Knowledge Graph Embeddings (ACL 2020)

【ACL2020-斯坦福大学】低维双曲线知识图谱嵌入，Low-Dimensional Hyperbolic Knowledge Graph Embeddings (ACL 2020)

专知会员服务

55+阅读 · 2020年7月3日

【斯坦福大学博士论文】大规模和高维统计学习方法和算法，147页pdf， Large-scale and high-dimensional statistical learning methods and algorithms

专知会员服务

26+阅读 · 2020年6月13日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

4+阅读 · 2018年11月15日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Incorporating Texture Information into Dimensionality Reduction for High-Dimensional Images

Arxiv

0+阅读 · 2022年2月18日

Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation

Arxiv

0+阅读 · 2022年2月17日

Conjugate priors and bias reduction for logistic regression models

Arxiv

0+阅读 · 2022年2月17日

Modeling High-Dimensional Data with Unknown Cut Points: A Fusion Penalized Logistic Threshold Regression

Arxiv

0+阅读 · 2022年2月17日

High-Dimensional High-Frequency Regression

Arxiv

1+阅读 · 2022年2月17日

Nearest Neighbor Dirichlet Mixtures

Arxiv

0+阅读 · 2022年2月17日

Sparse Markov Models for High-dimensional Inference

Arxiv

0+阅读 · 2022年2月16日

Failure and success of the spectral bias prediction for Kernel Ridge Regression: the case of low-dimensional data

Arxiv

0+阅读 · 2022年2月16日

Rates of Bootstrap Approximation for Eigenvalues in High-Dimensional PCA

Arxiv

0+阅读 · 2022年2月16日

Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

Arxiv

3+阅读 · 2021年6月6日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【ACL2020-斯坦福大学】低维双曲线知识图谱嵌入，Low-Dimensional Hyperbolic Knowledge Graph Embeddings (ACL 2020)

【ACL2020-斯坦福大学】低维双曲线知识图谱嵌入，Low-Dimensional Hyperbolic Knowledge Graph Embeddings (ACL 2020)

专知会员服务

55+阅读 · 2020年7月3日

【斯坦福大学博士论文】大规模和高维统计学习方法和算法，147页pdf， Large-scale and high-dimensional statistical learning methods and algorithms

专知会员服务

26+阅读 · 2020年6月13日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用射频传感器载荷增强无人机的侦察、监视与目标获取（ISR）能力》报告

《导航战》2025最新报告

人工智能驱动的国防战术通信与网络：提升现代战争中的态势感知、安全性与自主决策 | 万字长文

《有人-无人轻型驱逐舰与中型无人水面艇支队在第二与第一岛链作战中的部署概念（CONOPS）》56页报告

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

4+阅读 · 2018年11月15日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

逻辑回归（Logistic Regression）模型简介

逻辑回归（Logistic Regression）模型简介

全球人工智能

5+阅读 · 2017年11月1日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Incorporating Texture Information into Dimensionality Reduction for High-Dimensional Images

Arxiv

0+阅读 · 2022年2月18日

Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation

Arxiv

0+阅读 · 2022年2月17日

Conjugate priors and bias reduction for logistic regression models

Arxiv

0+阅读 · 2022年2月17日

Modeling High-Dimensional Data with Unknown Cut Points: A Fusion Penalized Logistic Threshold Regression

Arxiv

0+阅读 · 2022年2月17日

High-Dimensional High-Frequency Regression

Arxiv

1+阅读 · 2022年2月17日

Nearest Neighbor Dirichlet Mixtures

Arxiv

0+阅读 · 2022年2月17日

Sparse Markov Models for High-dimensional Inference

Arxiv

0+阅读 · 2022年2月16日

Failure and success of the spectral bias prediction for Kernel Ridge Regression: the case of low-dimensional data

Arxiv

0+阅读 · 2022年2月16日

Rates of Bootstrap Approximation for Eigenvalues in High-Dimensional PCA

Arxiv

0+阅读 · 2022年2月16日

Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

Arxiv

3+阅读 · 2021年6月6日

微信扫码咨询专知VIP会员