语文依赖性和统计依赖性 (Linguistic dependencies and statistical dependence) - 专知论文

会员服务 ·

0

统计量 · 归纳偏好 · 语言模型化 · INFORMS · 估计/估计量 ·

2021 年 4 月 18 日

Linguistic dependencies and statistical dependence

翻译：语文依赖性和统计依赖性

Jacob Louis Hoover,Alessandro Sordoni,Wenyu Du,Timothy J. O'Donnell

from arxiv, 8 pages, plus references and appendices

What is the relationship between linguistic dependencies and statistical dependence? Building on earlier work in NLP and cognitive science, we study this question. We introduce a contextualized version of pointwise mutual information (CPMI), using pretrained language models to estimate probabilities of words in context. Extracting dependency trees which maximize CPMI, we compare the resulting structures against gold dependencies. Overall, we find that these maximum-CPMI trees correspond to linguistic dependencies more often than trees extracted from non-contextual PMI estimate, but only roughly as often as a simple baseline formed by connecting adjacent words. We also provide evidence that the extent to which the two kinds of dependency align cannot be explained by the distance between words or by the category of the dependency relation. Finally, our analysis sheds some light on the differences between large pretrained language models, specifically in the kinds of inductive biases they encode.

翻译：语言依赖性和统计依赖性之间的关系是什么? 语言依赖性和统计依赖性之间的关系是什么? 在早期国家语言平台和认知科学工作的基础上,我们研究这一问题。我们引入了点对点相互信息的背景化版本(CPMI),使用预先培训的语言模型来估计文字在背景中的概率。采掘最大语言依赖性树,我们将由此产生的结构与黄金依赖性进行比较。总体而言,我们发现这些最大水平的CPMI树与语言依赖性相对比从非逻辑的PMI估计中提取的树木更经常地符合语言依赖性,但仅与连接相邻文字形成的简单基线相近。我们还提供了证据,说明两种依赖性一致的程度不能以语言之间的距离或依赖关系类别来解释。最后,我们的分析揭示出大量未经培训的语言模型之间的差别,特别是它们所编码的直导偏差类型。

1

相关内容

统计量

CS224N来了！斯坦福经典《自然语言处理》2021课程开讲！Manning、陈丹琦讲座

CS224N来了！斯坦福经典《自然语言处理》2021课程开讲！Manning、陈丹琦讲座

专知会员服务

69+阅读 · 2021年11月5日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

学习自然语言处理路线图

学习自然语言处理路线图

专知会员服务

140+阅读 · 2019年9月24日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

一文读懂依存句法分析

一文读懂依存句法分析

AINLP

16+阅读 · 2019年4月28日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Linguistically Regularized LSTMs for Sentiment Classification

Linguistically Regularized LSTMs for Sentiment Classification

黑龙江大学自然语言处理实验室

8+阅读 · 2018年5月4日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Bias, Consistency, and Alternative Perspectives of the Infinitesimal Jackknife

Arxiv

0+阅读 · 2021年6月10日

Global and Tail Dependence: A Differential Geometry Approach

Global and Tail Dependence: A Differential Geometry Approach

Arxiv

0+阅读 · 2021年6月10日

Conditional Distributional Treatment Effect with Kernel Conditional Mean Embeddings and U-Statistic Regression

Arxiv

0+阅读 · 2021年6月10日

A Neural Tangent Kernel Perspective of GANs

Arxiv

0+阅读 · 2021年6月10日

Probabilistic Deep Learning with Probabilistic Neural Networks and Deep Probabilistic Models

Arxiv

0+阅读 · 2021年6月9日

On the Use of Minimum Penalties in Statistical Learning

Arxiv

0+阅读 · 2021年6月9日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Arxiv

11+阅读 · 2020年5月8日

Visualizing and Measuring the Geometry of BERT

Visualizing and Measuring the Geometry of BERT

Arxiv

7+阅读 · 2019年10月28日

Structural Consistency and Controllability for Diverse Colorization

Structural Consistency and Controllability for Diverse Colorization

Arxiv

7+阅读 · 2018年9月6日

VIP会员

文章信息

相关主题

语言模型化

估计/估计量

相关VIP内容

CS224N来了！斯坦福经典《自然语言处理》2021课程开讲！Manning、陈丹琦讲座

CS224N来了！斯坦福经典《自然语言处理》2021课程开讲！Manning、陈丹琦讲座

专知会员服务

69+阅读 · 2021年11月5日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

学习自然语言处理路线图

学习自然语言处理路线图

专知会员服务

140+阅读 · 2019年9月24日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】迈向鲁棒的零样本强化学习

一种基于视觉算法生成三维场景重建的多任务系统 | 2025最新200页

【普林斯顿博士论文】量化、评估与缓解现代机器学习系统中的风险

遥感中基于深度学习的领域自适应方法：全面综述

相关资讯

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

一文读懂依存句法分析

一文读懂依存句法分析

AINLP

16+阅读 · 2019年4月28日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Linguistically Regularized LSTMs for Sentiment Classification

Linguistically Regularized LSTMs for Sentiment Classification

黑龙江大学自然语言处理实验室

8+阅读 · 2018年5月4日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Bias, Consistency, and Alternative Perspectives of the Infinitesimal Jackknife

Arxiv

0+阅读 · 2021年6月10日

Global and Tail Dependence: A Differential Geometry Approach

Global and Tail Dependence: A Differential Geometry Approach

Arxiv

0+阅读 · 2021年6月10日

Conditional Distributional Treatment Effect with Kernel Conditional Mean Embeddings and U-Statistic Regression

Arxiv

0+阅读 · 2021年6月10日

A Neural Tangent Kernel Perspective of GANs

Arxiv

0+阅读 · 2021年6月10日

Probabilistic Deep Learning with Probabilistic Neural Networks and Deep Probabilistic Models

Arxiv

0+阅读 · 2021年6月9日

On the Use of Minimum Penalties in Statistical Learning

Arxiv

0+阅读 · 2021年6月9日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Arxiv

11+阅读 · 2020年5月8日

Visualizing and Measuring the Geometry of BERT

Visualizing and Measuring the Geometry of BERT

Arxiv

7+阅读 · 2019年10月28日

Structural Consistency and Controllability for Diverse Colorization

Structural Consistency and Controllability for Diverse Colorization

Arxiv

7+阅读 · 2018年9月6日

微信扫码咨询专知VIP会员