神经对话辅导中的机遇与挑战 (Opportunities and Challenges in Neural Dialog Tutoring) - 专知论文

会员服务 ·

0

格分析 · 自动评估 · 语言模型 · 用户研究 · 质量评估 ·

2023 年 3 月 27 日

Opportunities and Challenges in Neural Dialog Tutoring

翻译：神经对话辅导中的机遇与挑战

Jakub Macina,Nico Daheim,Lingzhi Wang,Tanmay Sinha,Manu Kapur,Iryna Gurevych,Mrinmaya Sachan

from arxiv, EACL 2023 (main conference, camera-ready)

Designing dialog tutors has been challenging as it involves modeling the diverse and complex pedagogical strategies employed by human tutors. Although there have been significant recent advances in neural conversational systems using large language models (LLMs) and growth in available dialog corpora, dialog tutoring has largely remained unaffected by these advances. In this paper, we rigorously analyze various generative language models on two dialog tutoring datasets for language learning using automatic and human evaluations to understand the new opportunities brought by these advances as well as the challenges we must overcome to build models that would be usable in real educational settings. We find that although current approaches can model tutoring in constrained learning scenarios when the number of concepts to be taught and possible teacher strategies are small, they perform poorly in less constrained scenarios. Our human quality evaluation shows that both models and ground-truth annotations exhibit low performance in terms of equitable tutoring, which measures learning opportunities for students and how engaging the dialog is. To understand the behavior of our models in a real tutoring setting, we conduct a user study using expert annotators and find a significantly large number of model reasoning errors in 45% of conversations. Finally, we connect our findings to outline future work.

翻译：设计对话辅导程序一直是一项挑战，因为它涉及对人类辅导员所采用的多样化和复杂的教学策略进行建模。虽然最近有大型语言模型 (LLMs) 和可用对话语料库的增长，但对话辅导在很大程度上未受这些进展的影响。在本文中，我们严格分析了两个针对语言学习的对话辅导数据集上的各种生成式语言模型，使用自动评估和人类评估来理解这些进展带来的新机会，以及我们必须克服的挑战，才能构建可以在真实教育环境中使用的模型。我们发现，尽管当前方法可以在概念数量和可能的教师策略较少的受限学习场景中对辅导进行建模，但它们在不受限制的场景中表现不佳。我们的人类质量评估显示，无论是模型还是地面真实注释，在平等辅导方面表现都很差，即它们不能提供给学生平等的学习机会和有吸引力的对话环境。为了了解我们的模型在真正的辅导环境中的行为，我们进行了一项用户研究，使用专家标注者，并发现45%的对话存在大量的模型推理错误。最后，我们将我们的发现联系起来，概述未来的工作。

0

相关内容

格分析

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

专知会员服务

181+阅读 · 2020年6月23日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

专知会员服务

63+阅读 · 2020年5月12日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

专知会员服务

83+阅读 · 2020年5月10日

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

专知会员服务

84+阅读 · 2020年3月24日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

专知会员服务

14+阅读 · 2019年11月22日

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

专知会员服务

28+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

图与推荐

0+阅读 · 2022年10月7日

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

机器之心

0+阅读 · 2022年9月19日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

专知

1+阅读 · 2022年5月24日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

专知

12+阅读 · 2020年5月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

TIM-1-Fc介导辅助T淋巴细胞反应调控异位小肠移植免疫应答机制的研究

国家自然科学基金

0+阅读 · 2016年12月31日

NeuroD2调控人恶性神经胶质瘤细胞向神经元分化的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

脂质代谢调控基因SRB1和ABCA1在前列腺癌恶性进展中的作用和机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Axl受体酪氨酸激酶在系统性红斑狼疮单核/巨噬细胞极化调控中机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD4+T细胞亚群失衡在高眼压视神经损伤中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

先天免疫RIG-I信号通路与动脉粥样硬化的研究

国家自然科学基金

1+阅读 · 2012年12月31日

MicroRNA-199b-HIF-1α-VEGF通路对肺移植慢性免疫排异反应的调控作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

hTERT/Tet-on/GAL基因修饰BMSCs对慢性神经病理痛的可控性镇痛作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

单壁碳纳米管的免疫毒理和毒理基因组学分子机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Knowledge Graphs: Opportunities and Challenges

Arxiv

173+阅读 · 2023年3月24日

A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities

Arxiv

52+阅读 · 2022年5月13日

Federated Learning Challenges and Opportunities: An Outlook

Arxiv

15+阅读 · 2022年2月1日

Advances in Multi-turn Dialogue Comprehension: A Survey

Arxiv

23+阅读 · 2021年10月11日

AI in Finance: Challenges, Techniques and Opportunities

Arxiv

46+阅读 · 2021年7月20日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Attention, please! A survey of Neural Attention Models in Deep Learning

Arxiv

59+阅读 · 2021年3月31日

Recent Advances and Challenges in Task-oriented Dialog System

Recent Advances and Challenges in Task-oriented Dialog System

Arxiv

18+阅读 · 2020年3月19日

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Arxiv

77+阅读 · 2019年10月22日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

21+阅读 · 2019年5月13日

VIP会员

文章信息

相关主题

相关VIP内容

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

专知会员服务

181+阅读 · 2020年6月23日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

专知会员服务

63+阅读 · 2020年5月12日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management: Recent Advances and Challenges

专知会员服务

83+阅读 · 2020年5月10日

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

【清华大学】面向任务的对话系统的最新进展和挑战，Task-oriented Dialog System

专知会员服务

84+阅读 · 2020年3月24日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

【北京智源大会2019】AV+AI -挑战和机遇（AV+AI - Challenges and Opportunities），伯克利DeepDrive副主任詹景尧

专知会员服务

14+阅读 · 2019年11月22日

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

专知会员服务

28+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

图与推荐

0+阅读 · 2022年10月7日

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

大视觉模型方向，计算机视觉顶尖期刊 IJCV 特刊征稿

机器之心

0+阅读 · 2022年9月19日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

微软等ACL2022《知识增强自然语言处理》教程，阐述最新前沿技术，附185页ppt

专知

1+阅读 · 2022年5月24日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

专知

12+阅读 · 2020年5月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Knowledge Graphs: Opportunities and Challenges

Arxiv

173+阅读 · 2023年3月24日

A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities

Arxiv

52+阅读 · 2022年5月13日

Federated Learning Challenges and Opportunities: An Outlook

Arxiv

15+阅读 · 2022年2月1日

Advances in Multi-turn Dialogue Comprehension: A Survey

Arxiv

23+阅读 · 2021年10月11日

AI in Finance: Challenges, Techniques and Opportunities

Arxiv

46+阅读 · 2021年7月20日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Attention, please! A survey of Neural Attention Models in Deep Learning

Arxiv

59+阅读 · 2021年3月31日

Recent Advances and Challenges in Task-oriented Dialog System

Recent Advances and Challenges in Task-oriented Dialog System

Arxiv

18+阅读 · 2020年3月19日

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Arxiv

77+阅读 · 2019年10月22日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

21+阅读 · 2019年5月13日

相关基金

TIM-1-Fc介导辅助T淋巴细胞反应调控异位小肠移植免疫应答机制的研究

国家自然科学基金

0+阅读 · 2016年12月31日

NeuroD2调控人恶性神经胶质瘤细胞向神经元分化的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

脂质代谢调控基因SRB1和ABCA1在前列腺癌恶性进展中的作用和机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Axl受体酪氨酸激酶在系统性红斑狼疮单核/巨噬细胞极化调控中机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD4+T细胞亚群失衡在高眼压视神经损伤中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

先天免疫RIG-I信号通路与动脉粥样硬化的研究

国家自然科学基金

1+阅读 · 2012年12月31日

MicroRNA-199b-HIF-1α-VEGF通路对肺移植慢性免疫排异反应的调控作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

hTERT/Tet-on/GAL基因修饰BMSCs对慢性神经病理痛的可控性镇痛作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

单壁碳纳米管的免疫毒理和毒理基因组学分子机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员