BanglaNLG:评估孟加拉低资源自然语言生成的基准和资源 (BanglaNLG: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla)

This work presents BanglaNLG, a comprehensive benchmark for evaluating natural language generation (NLG) models in Bangla, a widely spoken yet low-resource language in the web domain. We aggregate three challenging conditional text generation tasks under the BanglaNLG benchmark. Then, using a clean corpus of 27.5 GB of Bangla data, we pretrain BanglaT5, a sequence-to-sequence Transformer model for Bangla. BanglaT5 achieves state-of-the-art performance in all of these tasks, outperforming mT5 (base) by up to 5.4%. We are making the BanglaT5 language model and a leaderboard publicly available in the hope of advancing future research and evaluation on Bangla NLG. The resources can be found at https://github.com/csebuetnlp/BanglaNLG.

翻译：这项工作提出了BanglaNLG(BanglaNLG)模型,该模型是评价孟加拉语的自然语言生成(NLG)模型的一个全面基准,孟加拉语是网络域中广泛使用但资源较少的语言。我们根据BanglaNLG基准汇总了三项具有挑战性的有条件文本生成任务。然后,我们利用27.5GB孟加拉语数据这一干净的集合体,对BanglaT5(孟加拉语的序列到序列转换模型)进行了准备。BanglaT5(孟加拉语)在所有这些任务中都取得了最先进的表现,比MT5(基础)高5.4%。我们正在将BanglaT5语言模型和一个领导板公开提供,以期推进Bangla NLG的未来研究和评价。这些资源可以在https://github.com/csebuetnp/BanglaNLG上找到。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日