RPLKG：基于知识图谱的鲁棒提示学习 (RPLKG: Robust Prompt Learning with Knowledge Graph)

Large-scale pre-trained models have been known that they are transferable, and they generalize well on the unseen dataset. Recently, multimodal pre-trained models such as CLIP show significant performance improvement in diverse experiments. However, when the labeled dataset is limited, the generalization of a new dataset or domain is still challenging. To improve the generalization performance on few-shot learning, there have been diverse efforts, such as prompt learning and adapter. However, the current few-shot adaptation methods are not interpretable, and they require a high computation cost for adaptation. In this study, we propose a new method, robust prompt learning with knowledge graph (RPLKG). Based on the knowledge graph, we automatically design diverse interpretable and meaningful prompt sets. Our model obtains cached embeddings of prompt sets after one forwarding from a large pre-trained model. After that, model optimizes the prompt selection processes with GumbelSoftmax. In this way, our model is trained using relatively little memory and learning time. Also, RPLKG selects the optimal interpretable prompt automatically, depending on the dataset. In summary, RPLKG is i) interpretable, ii) requires small computation resources, and iii) easy to incorporate prior human knowledge. To validate the RPLKG, we provide comprehensive experimental results on few-shot learning, domain generalization and new class generalization setting. RPLKG shows a significant performance improvement compared to zero-shot learning and competitive performance against several prompt learning methods using much lower resources.

翻译：大规模预训练模型已经被证明具有可转移性，并且在未见过的数据集上具有很好的泛化性能。最近，多模态预训练模型（如CLIP）在各种实验中表现出了显着的性能改进。然而，当标记的数据集有限时，新数据集或新领域的泛化仍然具有挑战性。为了改善少样本学习的泛化性能，出现了各种各样的努力，例如提示学习和适配器。然而，当前的少样本自适应方法不可解释，并且需要高计算成本来进行自适应。在本研究中，我们提出了一种新方法——基于知识图谱的鲁棒提示学习（RPLKG）。基于知识图谱，我们自动设计了多样化的可解释和有意义的提示集。我们的模型通过从大型预训练模型进行一次转发来获取提示集的缓存嵌入。之后，模型使用GumbelSoftmax优化提示选择过程。通过这种方式，我们的模型使用相对较少的内存和学习时间进行训练。此外，RPLKG根据数据集自动选择最佳的可解释提示。总之，RPLKG具有以下特点：i）可解释性好；ii）需要的计算资源较少；iii）容易结合先前的人类知识。为了验证RPLKG的性能，我们在少样本学习、领域泛化和新类别泛化设置上提供了全面的实验结果。与零样本学习相比，RPLKG显示出了显着的性能提升，并且在使用更低资源的情况下与几种提示学习方法进行了竞争性能。