Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling models to support any service in zero-shot through schemas, which describe service APIs to models in natural language. We explore the robustness of dialogue systems to linguistic variations in schemas by designing SGD-X - a benchmark extending SGD with semantically similar yet stylistically diverse variants for every schema. We observe that two top state tracking models fail to generalize well across schema variants, measured by joint goal accuracy and a novel metric for measuring schema sensitivity. Additionally, we present a simple model-agnostic data augmentation method to improve schema robustness.
翻译:“Schema-Guided Diaction”(SGD)数据集引入了一种模式,使模型能够支持通过Schemas零射(Schemas)为自然语言模型提供的API服务。我们通过设计SGD-X(SGD-X)这一基准,将SGD扩大为每个Schma(SGD)的语义相似但结构多样的变体。我们观察到,两个顶级的州跟踪模型无法对化学变体进行精确的概括,用联合目标精确度和测量化学灵敏度的新指标来衡量。此外,我们提出了一个简单的模型-不可知性数据增强方法,以改善化学的稳健性。