多元组损失：一种训练阅读理解和逻辑推理模型的反向方法 (Polytuplet Loss: A Reverse Approach to Training Reading Comprehension and Logical Reasoning Models) - 专知论文

会员服务 ·

0

逻辑推理 · 机器阅读理解 · 损失 · 损失函数 · 推理模型 ·

2023 年 4 月 3 日

Polytuplet Loss: A Reverse Approach to Training Reading Comprehension and Logical Reasoning Models

翻译：多元组损失：一种训练阅读理解和逻辑推理模型的反向方法

Jeffrey Lu,Ivan Rodriguez

Throughout schooling, students are tested on reading comprehension and logical reasoning. Students have developed various strategies for completing such exams, some of which are generally thought to outperform others. One such strategy involves emphasizing relative accuracy over absolute accuracy and can theoretically produce the correct answer without full knowledge of the information required to solve the question. This paper examines the effectiveness of applying such a strategy to train transfer learning models to solve reading comprehension and logical reasoning questions. The models were evaluated on the ReClor dataset, a challenging reading comprehension and logical reasoning benchmark. While previous studies targeted logical reasoning skills, we focus on a general training method and model architecture. We propose the polytuplet loss function, an extension of the triplet loss function, to ensure prioritization of learning the relative correctness of answer choices over learning the true accuracy of each choice. Our results indicate that models employing polytuplet loss outperform existing baseline models. Although polytuplet loss is a promising alternative to other contrastive loss functions, further research is required to quantify the benefits it may present.

翻译：在整个学校教育过程中，学生会接受阅读理解和逻辑推理方面的考试。学生们已经制定了各种完成此类考试的策略，其中一些被普遍认为比其他策略表现更好。其中一种策略是强调相对准确性而不是绝对准确性的策略，理论上可以在不完全掌握所需信息的情况下产生正确答案。本文研究了将这种策略应用于训练迁移学习模型以解决阅读理解和逻辑推理问题的有效性。模型在ReClor数据集上进行了评估，这是一个具有挑战性的阅读理解和逻辑推理基准。虽然以前的研究针对逻辑推理技能，但我们专注于一种通用的训练方法和模型架构。我们提出了多元组损失函数，这是三元组损失函数的扩展，以确保重点学习答案选项相对正确性而不是每个选项的真实准确性。我们的结果表明，采用多元组损失的模型优于现有基准模型。虽然多元组损失是其他对比损失函数的有希望的替代方案，但需要进一步研究以量化其可能带来的好处。

0

相关内容

逻辑推理

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

专知会员服务

105+阅读 · 2021年6月13日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

开放知识图谱

0+阅读 · 2022年10月31日

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

PaperWeekly

3+阅读 · 2022年9月21日

论文浅尝 | XQA：一个跨语言开放域问答数据集

论文浅尝 | XQA：一个跨语言开放域问答数据集

开放知识图谱

25+阅读 · 2019年9月11日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

专知

32+阅读 · 2018年2月28日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

中性粒细胞TRPM2通道在脓毒症细菌清除中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

抗MRSA活性rhodomyrtosone B类似物的合成和构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

类黄酮与磷脂混合胶束的形成及其模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

信号通路XBP1-p21在细胞周期调控中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ca2+调控骨髓基质干细胞BMP-2、Ang-1成骨及血管化的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

光信号在植物microRNA转录和加工过程中的调控分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

超低折射率二氧化硅光学薄膜的微观结构与光学性能研究

国家自然科学基金

1+阅读 · 2013年12月31日

Ca2+信号通路介导猪骨髓MSCs成脂分化的分子机制及其营养调控

国家自然科学基金

0+阅读 · 2012年12月31日

miR-210 介导的牙周膜干细胞修复标准骨缺损及ERK/P38信号转导通路的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Arxiv

0+阅读 · 2023年5月24日

Rethinking Existential First Order Queries and their Inference on Knowledge Graphs

Arxiv

1+阅读 · 2023年5月24日

Machine Reading Comprehension using Case-based Reasoning

Arxiv

0+阅读 · 2023年5月24日

Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study

Arxiv

0+阅读 · 2023年5月22日

Teaching Probabilistic Logical Reasoning to Transformers

Arxiv

0+阅读 · 2023年5月22日

Stability, Generalization and Privacy: Precise Analysis for Random and NTK Features

Arxiv

0+阅读 · 2023年5月20日

Transformers in Medical Imaging: A Survey

Arxiv

15+阅读 · 2022年1月24日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

VIP会员

文章信息

相关主题

机器阅读理解

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

如何进行有效知识推理？斯坦福Jure《联合知识图谱与语言模型的推理》报告，附81页ppt

专知会员服务

105+阅读 · 2021年6月13日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

论文浅尝 | Neural-Symbolic Models for Logical Queries on KG

开放知识图谱

0+阅读 · 2022年10月31日

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

IJCAI 2022 | 使用陈述句进行视觉问答的Prompt Tuning

PaperWeekly

3+阅读 · 2022年9月21日

论文浅尝 | XQA：一个跨语言开放域问答数据集

论文浅尝 | XQA：一个跨语言开放域问答数据集

开放知识图谱

25+阅读 · 2019年9月11日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

【论文推荐】最新六篇视觉问答（VQA）相关论文—盲人问题、物体计数、多模态解释、视觉关系、对抗性网络、对偶循环注意力

专知

32+阅读 · 2018年2月28日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Learning Answer Generation using Supervision from Automatic Question Answering Evaluators

Arxiv

0+阅读 · 2023年5月24日

Rethinking Existential First Order Queries and their Inference on Knowledge Graphs

Arxiv

1+阅读 · 2023年5月24日

Machine Reading Comprehension using Case-based Reasoning

Arxiv

0+阅读 · 2023年5月24日

Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study

Arxiv

0+阅读 · 2023年5月22日

Teaching Probabilistic Logical Reasoning to Transformers

Arxiv

0+阅读 · 2023年5月22日

Stability, Generalization and Privacy: Precise Analysis for Random and NTK Features

Arxiv

0+阅读 · 2023年5月20日

Transformers in Medical Imaging: A Survey

Arxiv

15+阅读 · 2022年1月24日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

相关基金

中性粒细胞TRPM2通道在脓毒症细菌清除中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

抗MRSA活性rhodomyrtosone B类似物的合成和构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

类黄酮与磷脂混合胶束的形成及其模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

信号通路XBP1-p21在细胞周期调控中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ca2+调控骨髓基质干细胞BMP-2、Ang-1成骨及血管化的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

光信号在植物microRNA转录和加工过程中的调控分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

超低折射率二氧化硅光学薄膜的微观结构与光学性能研究

国家自然科学基金

1+阅读 · 2013年12月31日

Ca2+信号通路介导猪骨髓MSCs成脂分化的分子机制及其营养调控

国家自然科学基金

0+阅读 · 2012年12月31日

miR-210 介导的牙周膜干细胞修复标准骨缺损及ERK/P38信号转导通路的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员