XLM：用于非自回归语言模型的Python软件包 (XLM: A Python package for non-autoregressive language models) - 专知论文

会员服务 ·

0

XLM · 软件 · 语言模型 · 语言建模 · Python ·

XLM: A Python package for non-autoregressive language models

翻译：XLM：用于非自回归语言模型的Python软件包

Dhruvesh Patel,Durga Prasad Maram,Sai Sreenivas Chintha,Benjamin Rozonoyer,Andrew McCallum

from arxiv, Code available at https://github.com/dhruvdcoder/xlm-core

In recent years, there has been a resurgence of interest in non-autoregressive text generation in the context of general language modeling. Unlike the well-established autoregressive language modeling paradigm, which has a plethora of standard training and inference libraries, implementations of non-autoregressive language modeling have largely been bespoke making it difficult to perform systematic comparisons of different methods. Moreover, each non-autoregressive language model typically requires it own data collation, loss, and prediction logic, making it challenging to reuse common components. In this work, we present the XLM python package, which is designed to make implementing small non-autoregressive language models faster with a secondary goal of providing a suite of small pre-trained models (through a companion xlm-models package) that can be used by the research community. The code is available at https://github.com/dhruvdcoder/xlm-core.

翻译：近年来，在通用语言建模背景下，非自回归文本生成的研究兴趣再度兴起。与已建立完善标准训练和推理库的自回归语言建模范式不同，非自回归语言建模的实现大多为定制化方案，导致难以对不同方法进行系统性比较。此外，每种非自回归语言模型通常需要独立的数据整理、损失函数和预测逻辑，这使得通用组件的复用面临挑战。本研究提出XLM Python软件包，其核心设计目标是加速小型非自回归语言模型的实现，次要目标是通过配套的xlm-models软件包为研究社区提供一套可用于实验的预训练小型模型。代码发布于https://github.com/dhruvdcoder/xlm-core。

0

相关内容

XLM

【KDD2024】HiGPT:异构图语言模型

【KDD2024】HiGPT:异构图语言模型

专知会员服务

19+阅读 · 2024年7月9日

【AAAI2024】面向序列推荐的插件扩散模型

【AAAI2024】面向序列推荐的插件扩散模型

专知会员服务

27+阅读 · 2024年1月9日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

可解释的自然语言处理方法简介

专知会员服务

81+阅读 · 2021年5月30日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

Facebook开源MUSE：多语言无监督和监督词向量库

Facebook开源MUSE：多语言无监督和监督词向量库

论智

20+阅读 · 2017年12月23日

LibRec 每周算法：LDA主题模型

LibRec 每周算法：LDA主题模型

LibRec智能推荐

29+阅读 · 2017年12月4日

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

不确定知识图谱中面向结构查询的众包清洗研究

国家自然科学基金

4+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography

Arxiv

0+阅读 · 12月23日

Structured Language Generation Model: Loss Calibration and Formatted Decoding for Robust Structure Prediction and Knowledge Retrieval

Arxiv

0+阅读 · 12月22日

ChemATP: A Training-Free Chemical Reasoning Framework for Large Language Models

Arxiv

0+阅读 · 12月22日

TabRep: Training Tabular Diffusion Models with a Simple and Effective Continuous Representation

Arxiv

0+阅读 · 12月21日

GTMA: Dynamic Representation Optimization for OOD Vision-Language Models

Arxiv

0+阅读 · 12月20日

VIP会员

文章信息

相关主题

相关VIP内容

【KDD2024】HiGPT:异构图语言模型

【KDD2024】HiGPT:异构图语言模型

专知会员服务

19+阅读 · 2024年7月9日

【AAAI2024】面向序列推荐的插件扩散模型

【AAAI2024】面向序列推荐的插件扩散模型

专知会员服务

27+阅读 · 2024年1月9日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

可解释的自然语言处理方法简介

专知会员服务

81+阅读 · 2021年5月30日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能改善军事警察行动：当下现状探索》最新95页报告

《用于适应性、任务就绪型军用仿生机器人的合成数据管道》

面向现代武装力量的高级AI驱动军事模拟与训练软件

《军事应用中的AI：建立信任》最新报告

相关资讯

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

Facebook开源MUSE：多语言无监督和监督词向量库

Facebook开源MUSE：多语言无监督和监督词向量库

论智

20+阅读 · 2017年12月23日

LibRec 每周算法：LDA主题模型

LibRec 每周算法：LDA主题模型

LibRec智能推荐

29+阅读 · 2017年12月4日

相关论文

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography

Arxiv

0+阅读 · 12月23日

Structured Language Generation Model: Loss Calibration and Formatted Decoding for Robust Structure Prediction and Knowledge Retrieval

Arxiv

0+阅读 · 12月22日

ChemATP: A Training-Free Chemical Reasoning Framework for Large Language Models

Arxiv

0+阅读 · 12月22日

TabRep: Training Tabular Diffusion Models with a Simple and Effective Continuous Representation

Arxiv

0+阅读 · 12月21日

GTMA: Dynamic Representation Optimization for OOD Vision-Language Models

Arxiv

0+阅读 · 12月20日

相关基金

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

不确定知识图谱中面向结构查询的众包清洗研究

国家自然科学基金

4+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

一般误差分布下若干半参数模型的复合分位数方法

国家自然科学基金

0+阅读 · 2014年12月31日

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员