基于知识蒸馏的中文语法错误校正 (Chinese grammatical error correction based on knowledge distillation) - 专知论文

会员服务 ·

0

知识 (knowledge) · 蒸馏 · MoDELS · 稳健性 · 情景 ·

2022 年 8 月 5 日

Chinese grammatical error correction based on knowledge distillation

翻译：基于知识蒸馏的中文语法错误校正

Peng Xia,Yuechi Zhou,Ziyan Zhang,Zecheng Tang,Juntao Li

from arxiv, The paper need to withdrawn due to my advisor's request. And we will submit a new one after we modify it and translate it into English to make the paper be read more widely.

In view of the poor robustness of existing Chinese grammatical error correction models on attack test sets and large model parameters, this paper uses the method of knowledge distillation to compress model parameters and improve the anti-attack ability of the model. In terms of data, the attack test set is constructed by integrating the disturbance into the standard evaluation data set, and the model robustness is evaluated by the attack test set. The experimental results show that the distilled small model can ensure the performance and improve the training speed under the condition of reducing the number of model parameters, and achieve the optimal effect on the attack test set, and the robustness is significantly improved.

翻译：鉴于现有中国攻击试验机组和大型模型参数的语法错误校正模型不强,本文使用知识蒸馏法压缩模型参数,提高模型的反攻击能力,在数据方面,攻击试验组通过将扰动纳入标准评价数据集构建,模型坚固度由攻击试验组进行评估,实验结果表明,蒸馏的小模型可以在减少模型参数数目的条件下确保性能,提高培训速度,实现对攻击试验组的最佳效果,而且强性得到显著改善。

0

相关内容

知识 (knowledge)

知识 (knowledge)

通过学习、实践或探索所获得的认识、判断或技能。

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

生理活性物质的电化学发光分析新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀土纳米材料电化学发光性能及生物传感研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于用户模型的移动设备可用性评估方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

胆固醇在食品加工中的氧化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于量子点标记物的电化学发光免疫分析新体系和新方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

Training from a Better Start Point: Active Self-Semi-Supervised Learning for Few Labeled Samples

Arxiv

0+阅读 · 2022年10月5日

Robust Active Distillation

Arxiv

0+阅读 · 2022年10月3日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Training from a Better Start Point: Active Self-Semi-Supervised Learning for Few Labeled Samples

Arxiv

0+阅读 · 2022年10月5日

Robust Active Distillation

Arxiv

0+阅读 · 2022年10月3日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

相关基金

生理活性物质的电化学发光分析新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀土纳米材料电化学发光性能及生物传感研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于用户模型的移动设备可用性评估方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

胆固醇在食品加工中的氧化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于量子点标记物的电化学发光免疫分析新体系和新方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员