异种共变量模型的双重数据叠加 (Double Data Piling for Heterogeneous Covariance Models) - 专知论文

会员服务 ·

0

相互独立的 · 极大 · MoDELS · 测试数据 · 有向 ·

2022 年 11 月 28 日

Double Data Piling for Heterogeneous Covariance Models

翻译：异种共变量模型的双重数据叠加

Taehyun Kim,Jeongyoun Ahn,Sungkyu Jung

In this work, we characterize two data piling phenomenon for a high-dimensional binary classification problem with heterogeneous covariance models. The data piling refers to the phenomenon where projections of the training data onto a direction vector have exactly two distinct values, one for each class. This first data piling phenomenon occurs for any data when the dimension $p$ is larger than the sample size $n$. We show that the second data piling phenomenon, which refers to a data piling of independent test data, can occur in an asymptotic context where $p$ grows while $n$ is fixed. We further show that a second maximal data piling direction, which gives an asymptotic maximal distance between the two piles of independent test data, can be obtained by projecting the first maximal data piling direction onto the nullspace of the common leading eigenspace. This observation provides a theoretical explanation for the phenomenon where the optimal ridge parameter can be negative in the context of high-dimensional linear classification. Based on the second data piling phenomenon, we propose various linear classification rules which ensure perfect classification of high-dimension low-sample-size data under generalized heterogeneous spiked covariance models.

翻译：在这项工作中,我们用多种共差模型为高维的二进制分类问题确定两个数据堆积现象。数据堆积是指向方向矢量上的培训数据预测有两个截然不同的值, 每类一个。第一个数据堆积现象发生在任何数据中, 当维维维值$p$大于样本大小时。我们显示第二个数据堆积现象, 指独立测试数据的一个数据堆积数据, 可能发生在一个零星环境中, 即美元增长而美元固定。我们进一步显示, 第二个最大数据堆积方向, 给两个独立测试数据堆之间带来一个无同步的最大距离, 可以通过预测第一个最大数据堆积方向与共同导导出电子空间的空格。我们的观察为在高度线性线性分类中, 最佳脊柱参数可能为负数的现象提供了理论解释。基于第二个数据堆积现象, 我们提出各种线性分类规则, 以确保高二进制的低等同度数据质化模型的完美分类。

0

相关内容

相互独立的

相互独立的

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Twitter大佬在线讲座：GNN through the Lens of Curvature

Twitter大佬在线讲座：GNN through the Lens of Curvature

图与推荐

1+阅读 · 2022年4月12日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

赤桉ICE1调控低温胁迫响应的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Foxl2在三疣梭子蟹卵巢发育中的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Crif1调控Nrf2-ARE信号通路促进BMSCs抗辐射损伤机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

遗传相互作用网络的构建和分析

国家自然科学基金

1+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

高光度blazar的甚高能伽马射线辐射研究

国家自然科学基金

0+阅读 · 2009年12月31日

针刺抗氧化效应的TRx氧化还原调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning

Arxiv

0+阅读 · 2023年1月27日

A Deep Learning Method for Comparing Bayesian Hierarchical Models

Arxiv

0+阅读 · 2023年1月27日

Diverse Weight Averaging for Out-of-Distribution Generalization

Arxiv

0+阅读 · 2023年1月27日

Optimally-Weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference

Arxiv

0+阅读 · 2023年1月27日

Variable Selection for Doubly Robust Causal Inference

Arxiv

0+阅读 · 2023年1月26日

Graph-based Recommendation for Sparse and Heterogeneous User Interactions

Arxiv

0+阅读 · 2023年1月26日

Proximal Causal Learning of Heterogeneous Treatment Effects

Arxiv

0+阅读 · 2023年1月26日

Statistical Inference and Large-scale Multiple Testing for High-dimensional Regression Models

Arxiv

0+阅读 · 2023年1月25日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

Twitter大佬在线讲座：GNN through the Lens of Curvature

Twitter大佬在线讲座：GNN through the Lens of Curvature

图与推荐

1+阅读 · 2022年4月12日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning

Arxiv

0+阅读 · 2023年1月27日

A Deep Learning Method for Comparing Bayesian Hierarchical Models

Arxiv

0+阅读 · 2023年1月27日

Diverse Weight Averaging for Out-of-Distribution Generalization

Arxiv

0+阅读 · 2023年1月27日

Optimally-Weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference

Arxiv

0+阅读 · 2023年1月27日

Variable Selection for Doubly Robust Causal Inference

Arxiv

0+阅读 · 2023年1月26日

Graph-based Recommendation for Sparse and Heterogeneous User Interactions

Arxiv

0+阅读 · 2023年1月26日

Proximal Causal Learning of Heterogeneous Treatment Effects

Arxiv

0+阅读 · 2023年1月26日

Statistical Inference and Large-scale Multiple Testing for High-dimensional Regression Models

Arxiv

0+阅读 · 2023年1月25日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

相关基金

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

赤桉ICE1调控低温胁迫响应的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Foxl2在三疣梭子蟹卵巢发育中的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Crif1调控Nrf2-ARE信号通路促进BMSCs抗辐射损伤机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

遗传相互作用网络的构建和分析

国家自然科学基金

1+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

高光度blazar的甚高能伽马射线辐射研究

国家自然科学基金

0+阅读 · 2009年12月31日

针刺抗氧化效应的TRx氧化还原调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员