寻找类似活动的经验研究 (An Empirical Study of Finding Similar Exercises) - 专知论文

会员服务 ·

0

FSE · 相似度 · 可理解性 · MoDELS · 语言模型化 ·

2021 年 11 月 16 日

An Empirical Study of Finding Similar Exercises

翻译：寻找类似活动的经验研究

Tongwen Huang,Xihua Li

from arxiv, 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Workshop on Math AI for Education(MATHAI4ED)

Education artificial intelligence aims to profit tasks in the education domain such as intelligent test paper generation and consolidation exercises where the main technique behind is how to match the exercises, known as the finding similar exercises(FSE) problem. Most of these approaches emphasized their model abilities to represent the exercise, unfortunately there are still many challenges such as the scarcity of data, insufficient understanding of exercises and high label noises. We release a Chinese education pre-trained language model BERT$_{Edu}$ for the label-scarce dataset and introduce the exercise normalization to overcome the diversity of mathematical formulas and terms in exercise. We discover new auxiliary tasks in an innovative way depends on problem-solving ideas and propose a very effective MoE enhanced multi-task model for FSE task to attain better understanding of exercises. In addition, confidence learning was utilized to prune train-set and overcome high noises in labeling data. Experiments show that these methods proposed in this paper are very effective.

翻译：教育人工智能旨在为教育领域的任务谋利,如智能测试纸制作和整合练习,其背后的主要技术是如何匹配练习,即类似练习(FSE)的发现问题,这些方法大多强调其示范能力来代表练习,不幸的是,仍然存在许多挑战,如数据匮乏、对练习理解不足和标签噪音高等。我们为标签碎片数据集发行了中国教育前语言模型BERT$ ⁇ Edu}(BERT$ edu}$),并引入了练习正常化,以克服数学公式和术语在练习中的多样性。我们发现新的辅助任务,以创新的方式取决于解决问题的想法,并为FSE任务提出了一个非常有效的教育部强化多任务模型,以更好地了解练习。此外,还利用了信心学习,在标签数据中设置和克服高噪音。实验表明,本文件提出的这些方法非常有效。

0

相关内容

FSE

FSE：Fast Software Encryption。 Explanation：快速软件加密。 Publisher：Springer。 SIT： http://dblp.uni-trier.de/db/conf/fse/

【哈佛大学干货书】概率导论，589页pdf，Introduction to Probability

【哈佛大学干货书】概率导论，589页pdf，Introduction to Probability

专知会员服务

138+阅读 · 2021年1月24日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【新书】R语言统计学习，R for Statistical Learning，301页pdf

专知会员服务

30+阅读 · 2020年11月4日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

保序最优传输：Order-preserving Optimal Transport

保序最优传输：Order-preserving Optimal Transport

我爱读PAMI

6+阅读 · 2018年9月16日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Expert Finding in Legal Community Question Answering

Arxiv

0+阅读 · 2022年1月19日

Optimizing Active Learning for Low Annotation Budgets

Arxiv

0+阅读 · 2022年1月18日

An Empirical Study on the Impact of Deep Parameters on Mobile App Energy Usage

Arxiv

0+阅读 · 2022年1月16日

Risk-Monotonicity in Statistical Learning

Arxiv

0+阅读 · 2022年1月15日

Decision Models for Selecting Patterns and Strategies in Microservices Systems and their Evaluation by Practitioners

Arxiv

0+阅读 · 2022年1月15日

Reinforcement Learning in Time-Varying Systems: an Empirical Study

Arxiv

0+阅读 · 2022年1月14日

A Case Study on Optimization of Warehouses

Arxiv

0+阅读 · 2021年11月23日

Advances in Natural Language Question Answering: A Review

Advances in Natural Language Question Answering: A Review

Arxiv

5+阅读 · 2019年4月10日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

Weakly-Supervised Hierarchical Text Classification

Arxiv

3+阅读 · 2018年12月29日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【哈佛大学干货书】概率导论，589页pdf，Introduction to Probability

【哈佛大学干货书】概率导论，589页pdf，Introduction to Probability

专知会员服务

138+阅读 · 2021年1月24日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【新书】R语言统计学习，R for Statistical Learning，301页pdf

专知会员服务

30+阅读 · 2020年11月4日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

保序最优传输：Order-preserving Optimal Transport

保序最优传输：Order-preserving Optimal Transport

我爱读PAMI

6+阅读 · 2018年9月16日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Expert Finding in Legal Community Question Answering

Arxiv

0+阅读 · 2022年1月19日

Optimizing Active Learning for Low Annotation Budgets

Arxiv

0+阅读 · 2022年1月18日

An Empirical Study on the Impact of Deep Parameters on Mobile App Energy Usage

Arxiv

0+阅读 · 2022年1月16日

Risk-Monotonicity in Statistical Learning

Arxiv

0+阅读 · 2022年1月15日

Decision Models for Selecting Patterns and Strategies in Microservices Systems and their Evaluation by Practitioners

Arxiv

0+阅读 · 2022年1月15日

Reinforcement Learning in Time-Varying Systems: an Empirical Study

Arxiv

0+阅读 · 2022年1月14日

A Case Study on Optimization of Warehouses

Arxiv

0+阅读 · 2021年11月23日

Advances in Natural Language Question Answering: A Review

Advances in Natural Language Question Answering: A Review

Arxiv

5+阅读 · 2019年4月10日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

Weakly-Supervised Hierarchical Text Classification

Arxiv

3+阅读 · 2018年12月29日

微信扫码咨询专知VIP会员