使用小元件分析来打猎模式 (Mode Hunting Using Pettiest Components Analysis) - 专知论文

会员服务 ·

0

可约的 · 规范化的 · 峰值 · Better · PCA ·

2021 年 1 月 12 日

Mode Hunting Using Pettiest Components Analysis

翻译：使用小元件分析来打猎模式

Tianhao Liu,Daniel Andrés Díaz-Pachón,J. Sunil Rao,Jean-Eudes Dazard

from arxiv, 10 pages, 2 tables, 3 figures

Principal component analysis has been used to reduce dimensionality of datasets for a long time. In this paper, we will demonstrate that in mode detection the components of smallest variance, the pettiest components, are more important. We prove that when the data follows a multivariate normal distribution, by implementing "pettiest component analysis" when the data is normally distributed, we obtain boxes of optimal size in the sense that their size is minimal over all possible boxes with the same number of dimensions and given probability. We illustrate our result with a simulation revealing that pettiest component analysis works better than its competitors.

翻译：长期以来,主要元件分析被用来减少数据集的维度。在本文中,我们将证明,在模式检测中,最小差异的元件、小元件组件更为重要。我们证明,当数据遵循多变量正常分布时,通过在数据通常分布时进行“最小元件分析”,我们获得最佳尺寸的盒子,其含义是,所有可能的盒体大小最小,尺寸和概率相同。我们用模拟来说明我们的结果,显示毛件分析比其竞争者效果更好。

0

相关内容

可约的

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

应用机器学习书稿，361页pdf

应用机器学习书稿，361页pdf

专知会员服务

59+阅读 · 2020年11月24日

【干货书】Python高级数据科学分析，424页pdf

【干货书】Python高级数据科学分析，424页pdf

专知会员服务

117+阅读 · 2020年8月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

专知会员服务

13+阅读 · 2019年8月27日

已删除

将门创投

6+阅读 · 2019年7月11日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Data-driven topology design using a deep generative model

Arxiv

0+阅读 · 2021年3月9日

A Case Study of Onboarding in Software Teams: Tasks and Strategies

Arxiv

0+阅读 · 2021年3月8日

Large-Sample Properties of Blind Estimation of the Linear Discriminant Using Projection Pursuit

Arxiv

0+阅读 · 2021年3月8日

Asymptotics of Ridge (less) Regression under General Source Condition

Arxiv

0+阅读 · 2021年3月8日

Asymptotics of Ridge Regression in Convolutional Models

Arxiv

0+阅读 · 2021年3月8日

Uncovering the Benefits and Challenges of Continuous Integration Practices

Arxiv

0+阅读 · 2021年3月7日

Some Properties and Applications of Burr III-Weibull Distribution

Some Properties and Applications of Burr III-Weibull Distribution

Arxiv

0+阅读 · 2021年3月5日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Arxiv

4+阅读 · 2018年11月5日

Approximability of Discriminators Implies Diversity in GANs

Approximability of Discriminators Implies Diversity in GANs

Arxiv

4+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

相关VIP内容

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

应用机器学习书稿，361页pdf

应用机器学习书稿，361页pdf

专知会员服务

59+阅读 · 2020年11月24日

【干货书】Python高级数据科学分析，424页pdf

【干货书】Python高级数据科学分析，424页pdf

专知会员服务

117+阅读 · 2020年8月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

专知会员服务

13+阅读 · 2019年8月27日

热门VIP内容

开通专知VIP会员享更多权益服务

赋能真实世界：基于大语言模型的产业智能体技术、实践与评测综述

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

相关资讯

已删除

将门创投

6+阅读 · 2019年7月11日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Data-driven topology design using a deep generative model

Arxiv

0+阅读 · 2021年3月9日

A Case Study of Onboarding in Software Teams: Tasks and Strategies

Arxiv

0+阅读 · 2021年3月8日

Large-Sample Properties of Blind Estimation of the Linear Discriminant Using Projection Pursuit

Arxiv

0+阅读 · 2021年3月8日

Asymptotics of Ridge (less) Regression under General Source Condition

Arxiv

0+阅读 · 2021年3月8日

Asymptotics of Ridge Regression in Convolutional Models

Arxiv

0+阅读 · 2021年3月8日

Uncovering the Benefits and Challenges of Continuous Integration Practices

Arxiv

0+阅读 · 2021年3月7日

Some Properties and Applications of Burr III-Weibull Distribution

Some Properties and Applications of Burr III-Weibull Distribution

Arxiv

0+阅读 · 2021年3月5日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Arxiv

4+阅读 · 2018年11月5日

Approximability of Discriminators Implies Diversity in GANs

Approximability of Discriminators Implies Diversity in GANs

Arxiv

4+阅读 · 2018年6月27日

微信扫码咨询专知VIP会员