标题：组合式零样本领域转移及其在文本到文本模型中的应用摘要：对于专业领域而言，标签不足是提高任务性能的瓶颈。我们提出了一种新颖的组合式转移学习框架（DoT5 - Domain Compositional Zero-Shot T5）用于零样本领域转移。在没有访问专业领域标签的情况下，DoT5以多任务的方式联合学习领域知识（从未标记专业领域文本的MLM中学习）和任务知识（从更易获取的通用领域数据的任务训练中学习）。为了改善任务训练的可转移性，我们设计了一种名为NLGU的策略：同时训练NLG用于生成标签到数据的领域数据增益，以及NLU用于标签预测。我们针对生物医学领域和资源匮乏的放射学子领域进行了DoT5的评估，主要关注NLI、文本摘要和嵌入学习。通过多任务学习，DoT5展示了组合式转移学习的有效性。特别是，在RadNLI方面，DoT5的零样本转移表现超过SOTA超过7个绝对准确度点。我们通过消融和案例研究验证了DoT5，证明其能够解决需要专业领域知识的具有挑战性的NLI案例。 (Compositional Zero-Shot Domain Transfer with Text-to-Text Models)

翻译：标题：组合式零样本领域转移及其在文本到文本模型中的应用摘要：对于专业领域而言，标签不足是提高任务性能的瓶颈。我们提出了一种新颖的组合式转移学习框架（DoT5 - Domain Compositional Zero-Shot T5）用于零样本领域转移。在没有访问专业领域标签的情况下，DoT5以多任务的方式联合学习领域知识（从未标记专业领域文本的MLM中学习）和任务知识（从更易获取的通用领域数据的任务训练中学习）。为了改善任务训练的可转移性，我们设计了一种名为NLGU的策略：同时训练NLG用于生成标签到数据的领域数据增益，以及NLU用于标签预测。我们针对生物医学领域和资源匮乏的放射学子领域进行了DoT5的评估，主要关注NLI、文本摘要和嵌入学习。通过多任务学习，DoT5展示了组合式转移学习的有效性。特别是，在RadNLI方面，DoT5的零样本转移表现超过SOTA超过7个绝对准确度点。我们通过消融和案例研究验证了DoT5，证明其能够解决需要专业领域知识的具有挑战性的NLI案例。

Fangyu Liu,Qianchu Liu,Shruthi Bannur,Fernando Pérez-García,Naoto Usuyama,Sheng Zhang,Tristan Naumann,Aditya Nori,Hoifung Poon,Javier Alvarez-Valle,Ozan Oktay,Stephanie L. Hyland

from arxiv, Accepted at TACL, pre-MIT Press publication version. 16 pages, 4 figures

Label scarcity is a bottleneck for improving task performance in specialised domains. We propose a novel compositional transfer learning framework (DoT5 - domain compositional zero-shot T5) for zero-shot domain transfer. Without access to in-domain labels, DoT5 jointly learns domain knowledge (from MLM of unlabelled in-domain free text) and task knowledge (from task training on more readily available general-domain data) in a multi-task manner. To improve the transferability of task training, we design a strategy named NLGU: we simultaneously train NLG for in-domain label-to-data generation which enables data augmentation for self-finetuning and NLU for label prediction. We evaluate DoT5 on the biomedical domain and the resource-lean subdomain of radiology, focusing on NLI, text summarisation and embedding learning. DoT5 demonstrates the effectiveness of compositional transfer learning through multi-task learning. In particular, DoT5 outperforms the current SOTA in zero-shot transfer by over 7 absolute points in accuracy on RadNLI. We validate DoT5 with ablations and a case study demonstrating its ability to solve challenging NLI examples requiring in-domain expertise.

翻译：