在语言建模中提取用于最接近最小化的加权加权自动数据 (Extracting Weighted Automata for Approximate Minimization in Language Modelling) - 专知论文

会员服务 ·

0

语言模型化 · 黑盒子 · 近似 · Weight · MoDELS ·

2021 年 7 月 23 日

Extracting Weighted Automata for Approximate Minimization in Language Modelling

翻译：在语言建模中提取用于最接近最小化的加权加权自动数据

Clara Lacroce,Prakash Panangaden,Guillaume Rabusseau

from arxiv, Full version of ICGI 2020/21 paper, authors are listed in alphabetical order

In this paper we study the approximate minimization problem for language modelling. We assume we are given some language model as a black box. The objective is to obtain a weighted finite automaton (WFA) that fits within a given size constraint and which mimics the behaviour of the original model while minimizing some notion of distance between the black box and the extracted WFA. We provide an algorithm for the approximate minimization of black boxes trained for language modelling of sequential data over a one-letter alphabet. By reformulating the problem in terms of Hankel matrices, we leverage classical results on the approximation of Hankel operators, namely the celebrated Adamyan-Arov-Krein (AAK) theory. This allows us to use the spectral norm to measure the distance between the black box and the WFA. We provide theoretical guarantees to study the potentially infinite-rank Hankel matrix of the black box, without accessing the training data, and we prove that our method returns an asymptotically-optimal approximation.

翻译：在本文中,我们研究了语言建模的最小化问题。我们假设我们得到了一些语言模型,作为黑盒。目标是获得一个符合一定尺寸限制的加权限量自动图(WFA),它模仿原始模型的行为,同时将黑盒与提取的WFA之间的距离概念最小化。我们为在单字母字母字母字母基础上进行顺序数据模拟培训的黑盒的大致最小化提供了算法。通过重塑汉克尔矩阵的问题,我们利用汉克尔操作员近似(即著名的阿达米安-阿罗夫-克林(AAK)理论)的经典结果。这使我们能够使用光谱规范来衡量黑盒与WFA之间的距离。我们提供理论保证,研究黑盒中潜在的无限汉克尔矩阵,而没有获得培训数据,我们证明我们的方法返回了一种无症状的最佳近似的近似值。

0

相关内容

语言模型化

语言模型化

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

专知会员服务

16+阅读 · 2019年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

EMNLP2019奖项出炉：华人一作获最佳论文，导师是NLP大神

EMNLP2019奖项出炉：华人一作获最佳论文，导师是NLP大神

新智元

5+阅读 · 2019年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Sample Efficient Model Evaluation

Sample Efficient Model Evaluation

Arxiv

0+阅读 · 2021年9月24日

Approximate Latent Force Model Inference

Arxiv

0+阅读 · 2021年9月24日

An algorithmic study in the vector model for Wireless Power Transfer maximization

Arxiv

0+阅读 · 2021年9月24日

The Information Projection in Moment Inequality Models: Existence, Dual Representation, and Approximation

Arxiv

0+阅读 · 2021年9月24日

Multidimensional Scaling: Approximation and Complexity

Multidimensional Scaling: Approximation and Complexity

Arxiv

0+阅读 · 2021年9月23日

Weighted Low Rank Matrix Approximation and Acceleration

Arxiv

0+阅读 · 2021年9月22日

Functional Data Analysis for Extracting the Intrinsic Dimensionality of Spectra -- Application: Chemical Homogeneity in Open Cluster M67

Arxiv

0+阅读 · 2021年9月22日

An artificial neural network approach to bifurcating phenomena in computational fluid dynamics

Arxiv

0+阅读 · 2021年9月22日

Age-Limited Capacity of Massive MIMO

Arxiv

0+阅读 · 2021年9月21日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

专知会员服务

16+阅读 · 2019年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

EMNLP2019奖项出炉：华人一作获最佳论文，导师是NLP大神

EMNLP2019奖项出炉：华人一作获最佳论文，导师是NLP大神

新智元

5+阅读 · 2019年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Sample Efficient Model Evaluation

Sample Efficient Model Evaluation

Arxiv

0+阅读 · 2021年9月24日

Approximate Latent Force Model Inference

Arxiv

0+阅读 · 2021年9月24日

An algorithmic study in the vector model for Wireless Power Transfer maximization

Arxiv

0+阅读 · 2021年9月24日

The Information Projection in Moment Inequality Models: Existence, Dual Representation, and Approximation

Arxiv

0+阅读 · 2021年9月24日

Multidimensional Scaling: Approximation and Complexity

Multidimensional Scaling: Approximation and Complexity

Arxiv

0+阅读 · 2021年9月23日

Weighted Low Rank Matrix Approximation and Acceleration

Arxiv

0+阅读 · 2021年9月22日

Functional Data Analysis for Extracting the Intrinsic Dimensionality of Spectra -- Application: Chemical Homogeneity in Open Cluster M67

Arxiv

0+阅读 · 2021年9月22日

An artificial neural network approach to bifurcating phenomena in computational fluid dynamics

Arxiv

0+阅读 · 2021年9月22日

Age-Limited Capacity of Massive MIMO

Arxiv

0+阅读 · 2021年9月21日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

微信扫码咨询专知VIP会员