解决张量低循环秩近似问题 (Solving Tensor Low Cycle Rank Approximation) - 专知论文

会员服务 ·

0

近似 · SODA · 低秩近似 · 概率 · 低秩 ·

2023 年 4 月 13 日

Solving Tensor Low Cycle Rank Approximation

翻译：解决张量低循环秩近似问题

Yichuan Deng,Yeqi Gao,Zhao Song

Large language models have become ubiquitous in modern life, finding applications in various domains such as natural language processing, language translation, and speech recognition. Recently, a breakthrough work [Zhao, Panigrahi, Ge, and Arora Arxiv 2023] explains the attention model from probabilistic context-free grammar (PCFG). One of the central computation task for computing probability in PCFG is formulating a particular tensor low rank approximation problem, we can call it tensor cycle rank. Given an $n \times n \times n$ third order tensor $A$, we say that $A$ has cycle rank-$k$ if there exists three $n \times k^2$ size matrices $U , V$, and $W$ such that for each entry in each \begin{align*} A_{a,b,c} = \sum_{i=1}^k \sum_{j=1}^k \sum_{l=1}^k U_{a,i+k(j-1)} \otimes V_{b, j + k(l-1)} \otimes W_{c, l + k(i-1) } \end{align*} for all $a \in [n], b \in [n], c \in [n]$. For the tensor classical rank, tucker rank and train rank, it has been well studied in [Song, Woodruff, Zhong SODA 2019]. In this paper, we generalize the previous ``rotation and sketch'' technique in page 186 of [Song, Woodruff, Zhong SODA 2019] and show an input sparsity time algorithm for cycle rank.

翻译：大型语言模型已经变得无处不在，应用于自然语言处理、语言翻译和语音识别等各个领域。最近一项突破性的工作[Zhao，Panigrahi，Ge和Arora Arxiv 2023]从概率上下文无关的语法(PCFG)解释了注意力模型。计算PCFG中概率的一个中心计算任务是构建一个特定的张量低秩近似问题，我们可以称之为张量循环秩。给定一个$n\times n\times n$三阶张量$A$，如果存在三个$n\times k^2$大小的矩阵$U$、$V$和$W$，使得对于每个条目和每个\begin{align *}A_{a,b,c}=\sum_{i=1}^k\sum_{j=1}^k\sum_{l=1}^kU_{a,i+k(j-1)}\otimes V_{b,j+k(l-1)}\otimes W_{c,l+k(i-1)}\end{align*}均成立，其中$a\in[n]$，$b\in[n]$，$c\in[n]$。对于张量的经典秩、Tucker秩和train秩已经有了深入的研究[Song，Woodruff，Zhong SODA 2019]。在本文中，我们推广了前面的“旋转和草图”技术[Song，Woodruff，Zhong SODA 2019的第186页]，展示了一个输入稀疏时间算法的循环秩。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图挖掘与多关系学习，亚马逊与CMU-WWW2021教程，附161页ppt

专知会员服务

37+阅读 · 2021年4月20日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

专知会员服务

55+阅读 · 2020年12月15日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【推荐】深度学习思维导图

【推荐】深度学习思维导图

机器学习研究会

15+阅读 · 2017年8月20日

相容幂domain结构与函数逼近结构相关问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

李超代数中若干问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

网络设计中的负载均衡问题

国家自然科学基金

0+阅读 · 2013年12月31日

群环的代数K理论及其结构

国家自然科学基金

0+阅读 · 2012年12月31日

由圈空间和键空间所构成的张量的若干问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

麦克斯韦方程组间断有限元方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

具有间断点的振动系统的逆谱问题

国家自然科学基金

0+阅读 · 2011年12月31日

金属纳米粒子光学性质的若干理论问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

KP系列的对称约束与矩阵积分

国家自然科学基金

0+阅读 · 2009年12月31日

高复杂度矩阵计算问题及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

Low-memory Krylov subspace methods for optimal rational matrix function approximation

Arxiv

0+阅读 · 2023年5月31日

Constraint Programming and Constructive Heuristics for Parallel Machine Scheduling with Sequence-Dependent Setups and Common Servers

Arxiv

0+阅读 · 2023年5月31日

Kaczmarz-Type Methods for Solving Matrix Equations

Arxiv

0+阅读 · 2023年5月31日

On the Approximability of External-Influence-Driven Problems

Arxiv

0+阅读 · 2023年5月30日

A Novel Shortest Paths Algorithm on Unweighted Graphs

Arxiv

0+阅读 · 2023年5月30日

Rate-Splitting Multiple Access: Finite Constellations, Receiver Design, and SIC-free Implementation

Arxiv

0+阅读 · 2023年5月30日

Alternating Local Enumeration (TnALE): Solving Tensor Network Structure Search with Fewer Evaluations

Arxiv

0+阅读 · 2023年5月29日

Correlating sparse sensing for large-scale traffic speed estimation: A Laplacian-enhanced low-rank tensor kriging approach

Arxiv

0+阅读 · 2023年5月28日

Solving the 2-MAXSAT Problem in Polynomial Time: A Proof of P = NP

Arxiv

0+阅读 · 2023年5月28日

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

Arxiv

0+阅读 · 2023年5月27日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图挖掘与多关系学习，亚马逊与CMU-WWW2021教程，附161页ppt

专知会员服务

37+阅读 · 2021年4月20日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

专知会员服务

55+阅读 · 2020年12月15日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【推荐】深度学习思维导图

【推荐】深度学习思维导图

机器学习研究会

15+阅读 · 2017年8月20日

相关论文

Low-memory Krylov subspace methods for optimal rational matrix function approximation

Arxiv

0+阅读 · 2023年5月31日

Constraint Programming and Constructive Heuristics for Parallel Machine Scheduling with Sequence-Dependent Setups and Common Servers

Arxiv

0+阅读 · 2023年5月31日

Kaczmarz-Type Methods for Solving Matrix Equations

Arxiv

0+阅读 · 2023年5月31日

On the Approximability of External-Influence-Driven Problems

Arxiv

0+阅读 · 2023年5月30日

A Novel Shortest Paths Algorithm on Unweighted Graphs

Arxiv

0+阅读 · 2023年5月30日

Rate-Splitting Multiple Access: Finite Constellations, Receiver Design, and SIC-free Implementation

Arxiv

0+阅读 · 2023年5月30日

Alternating Local Enumeration (TnALE): Solving Tensor Network Structure Search with Fewer Evaluations

Arxiv

0+阅读 · 2023年5月29日

Correlating sparse sensing for large-scale traffic speed estimation: A Laplacian-enhanced low-rank tensor kriging approach

Arxiv

0+阅读 · 2023年5月28日

Solving the 2-MAXSAT Problem in Polynomial Time: A Proof of P = NP

Arxiv

0+阅读 · 2023年5月28日

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

Arxiv

0+阅读 · 2023年5月27日

相关基金

相容幂domain结构与函数逼近结构相关问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

李超代数中若干问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

网络设计中的负载均衡问题

国家自然科学基金

0+阅读 · 2013年12月31日

群环的代数K理论及其结构

国家自然科学基金

0+阅读 · 2012年12月31日

由圈空间和键空间所构成的张量的若干问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

麦克斯韦方程组间断有限元方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

具有间断点的振动系统的逆谱问题

国家自然科学基金

0+阅读 · 2011年12月31日

金属纳米粒子光学性质的若干理论问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

KP系列的对称约束与矩阵积分

国家自然科学基金

0+阅读 · 2009年12月31日

高复杂度矩阵计算问题及其应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员