k ⁇ an:经校正的 CBOW 实施 (kōan: A Corrected CBOW Implementation) - 专知论文

会员服务 ·

0

连续词袋模型 · 词向量表示 · Continuity · Less · HTTPS ·

2020 年 12 月 30 日

kōan: A Corrected CBOW Implementation

翻译：k ⁇ an:经校正的 CBOW 实施

Ozan İrsoy,Adrian Benton,Karl Stratos

It is a common belief in the NLP community that continuous bag-of-words (CBOW) word embeddings tend to underperform skip-gram (SG) embeddings. We find that this belief is founded less on theoretical differences in their training objectives but more on faulty CBOW implementations in standard software libraries such as the official implementation word2vec.c and Gensim. We show that our correct implementation of CBOW yields word embeddings that are fully competitive with SG on various intrinsic and extrinsic tasks while being more than three times as fast to train. We release our implementation, k\=oan, at https://github.com/bloomberg/koan.

翻译：我们发现,这种信念的根基不是其培训目标的理论差异,而是标准软件图书馆,如正式执行单词2vec.c和Gensim的错误执行。我们显示,我们正确执行标准软件图书馆的CBOW生成了与SG在各种内在和外在任务上完全具有竞争力的词,同时在培训速度超过3倍的同时,与SG在各种内在和外在任务上具有充分竞争力。我们发布了我们的执行程序,即Kãoan,网址是https://github.com/bloomberg/koan。

0

相关内容

连续词袋模型

连续词袋模型

连续词袋模型（CBOW），利用上下文或周围的单词来预测中心词。其输入为某一个特征词的上下文相关对应的词向量（单词的one-hot编码）；输出为这特定的一个词的词向量(单词的one-hot编码）。

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

专知会员服务

170+阅读 · 2020年11月13日

最新《时序分类:深度序列模型》教程，172页ppt

最新《时序分类:深度序列模型》教程，172页ppt

专知会员服务

43+阅读 · 2020年11月11日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

【Github】nlp-tutorial：TensorFlow 和 PyTorch 实现各种NLP模型

【Github】nlp-tutorial：TensorFlow 和 PyTorch 实现各种NLP模型

AINLP

14+阅读 · 2019年9月4日

【Github】All4NLP：自然语言处理相关资源整理

【Github】All4NLP：自然语言处理相关资源整理

AINLP

23+阅读 · 2019年8月9日

基于 Doc2vec 训练句子向量

基于 Doc2vec 训练句子向量

AI研习社

6+阅读 · 2018年5月16日

已删除

将门创投

4+阅读 · 2017年11月1日

Median Optimal Treatment Regimes

Arxiv

0+阅读 · 2021年3月2日

The Age of Correlated Features in Supervised Learning based Forecasting

Arxiv

0+阅读 · 2021年2月27日

A method for determining the parameters in a rheological model for viscoelastic materials by minimizing Tikhonov functionals

Arxiv

0+阅读 · 2021年2月26日

A Baseline for Few-Shot Image Classification

Arxiv

7+阅读 · 2020年3月1日

Measuring Sentences Similarity: A Survey

Arxiv

7+阅读 · 2019年10月6日

Discriminative structural graph classification

Arxiv

5+阅读 · 2019年6月5日

Learning Graph Embeddings from WordNet-based Similarity Measures

Learning Graph Embeddings from WordNet-based Similarity Measures

Arxiv

4+阅读 · 2018年8月16日

Relational recurrent neural networks

Relational recurrent neural networks

Arxiv

8+阅读 · 2018年6月28日

Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time

Arxiv

4+阅读 · 2018年5月1日

Where to put the Image in an Image Caption Generator

Arxiv

3+阅读 · 2018年3月14日

VIP会员

文章信息

相关主题

连续词袋模型

词向量表示

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

专知会员服务

170+阅读 · 2020年11月13日

最新《时序分类:深度序列模型》教程，172页ppt

最新《时序分类:深度序列模型》教程，172页ppt

专知会员服务

43+阅读 · 2020年11月11日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML2025】SADA：基于稳定性引导的自适应扩散加速方法

【ETZH博士论文】低维与高维空间中潜在表示的分析、建模与变换，169页pdf

车辆目标轨迹预测方法研究综述及展望

【ACL2025教程】LLM时代的合成数据，228页slides

相关资讯

【Github】nlp-tutorial：TensorFlow 和 PyTorch 实现各种NLP模型

【Github】nlp-tutorial：TensorFlow 和 PyTorch 实现各种NLP模型

AINLP

14+阅读 · 2019年9月4日

【Github】All4NLP：自然语言处理相关资源整理

【Github】All4NLP：自然语言处理相关资源整理

AINLP

23+阅读 · 2019年8月9日

基于 Doc2vec 训练句子向量

基于 Doc2vec 训练句子向量

AI研习社

6+阅读 · 2018年5月16日

已删除

将门创投

4+阅读 · 2017年11月1日

相关论文

Median Optimal Treatment Regimes

Arxiv

0+阅读 · 2021年3月2日

The Age of Correlated Features in Supervised Learning based Forecasting

Arxiv

0+阅读 · 2021年2月27日

A method for determining the parameters in a rheological model for viscoelastic materials by minimizing Tikhonov functionals

Arxiv

0+阅读 · 2021年2月26日

A Baseline for Few-Shot Image Classification

Arxiv

7+阅读 · 2020年3月1日

Measuring Sentences Similarity: A Survey

Arxiv

7+阅读 · 2019年10月6日

Discriminative structural graph classification

Arxiv

5+阅读 · 2019年6月5日

Learning Graph Embeddings from WordNet-based Similarity Measures

Learning Graph Embeddings from WordNet-based Similarity Measures

Arxiv

4+阅读 · 2018年8月16日

Relational recurrent neural networks

Relational recurrent neural networks

Arxiv

8+阅读 · 2018年6月28日

Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time

Arxiv

4+阅读 · 2018年5月1日

Where to put the Image in an Image Caption Generator

Arxiv

3+阅读 · 2018年3月14日

微信扫码咨询专知VIP会员