Visual Question Answering (VQA) using multi-modal data facilitates real-life applications, such as home robots and medical diagnoses. However, one significant challenge is to design a robust learning method for various client tasks. One critical aspect is to ensure privacy, as client data sharing is limited due to confidentiality concerns. This work focuses on addressing the issue of confidentiality constraints in multi-client VQA tasks and limited labeled training data of clients. We propose the Unidirectional Split Learning with Contrastive Loss (UniCon) method to overcome these limitations. The proposed method trains a global model on the entire data distribution of different clients, learning refined cross-modal representations through model sharing. Privacy is ensured by utilizing a split learning architecture in which a complete model is partitioned into two components for independent training. Moreover, recent self-supervised learning techniques were found to be highly compatible with split learning. This combination allows for rapid learning of a classification task without labeled data. Furthermore, UniCon integrates knowledge from various local tasks, improving knowledge sharing efficiency. Comprehensive experiments were conducted on the VQA-v2 dataset using five state-of-the-art VQA models, demonstrating the effectiveness of UniCon. The best-performing model achieved a competitive accuracy of 49.89%. UniCon provides a promising solution to tackle VQA tasks in a distributed data silo setting while preserving client privacy.
翻译:摘要:使用多模态数据进行视觉问答(VQA)可以促进现实生活应用,例如家用机器人和医疗诊断。然而,一个重要的挑战是为各种客户任务设计稳健的学习方法。一个重要的方面是确保隐私,由于机密性问题,客户数据共享受到限制。本文重点解决在多客户VQA任务和有限标记训练数据的情况下解决保密约束的问题。我们提出了一种名为Unidirectional Split Learning with Contrastive Loss (UniCon)的方法来克服这些限制。所提出的方法在不同客户的整个数据分布上训练全局模型,通过模型共享学习精细的跨模态表示。隐私通过利用分割学习体系结构来保证,其中完整模型被划分为独立训练的两个组件。此外,最近的自我监督学习技术被发现与分割学习高度兼容。此组合允许在没有标记数据的情况下快速学习分类任务。此外,UniCon集成了来自各种本地任务的知识,提高了知识分享的效率。在VQA-v2数据集上使用五种最先进的VQA模型进行了全面实验,证明了UniCon的有效性。最佳性能模型的准确率达到了49.89%。UniCon为在分布式数据隔板设置下处理VQA任务并保护客户隐私提供了有前途的解决方案。