Despite the great success achieved, existing video moment retrieval (VMR) methods are developed under the assumption that data are centralizedly stored. However, in real-world applications, due to the inherent nature of data generation and privacy concerns, data are often distributed on different silos, bringing huge challenges to effective large-scale training. In this work, we try to overcome above limitation by leveraging the recent success of federated learning. As the first that is explored in VMR field, the new task is defined as video moment retrieval with distributed data. Then, a novel federated learning method named FedVMR is proposed to facilitate large-scale and secure training of VMR models in decentralized environment. Experiments on benchmark datasets demonstrate its effectiveness. This work is the very first attempt to enable safe and efficient VMR training in decentralized scene, which is hoped to pave the way for further study in the related research field.
翻译:尽管取得了巨大成功,但现有的视频瞬间检索方法是在数据集中储存的假设下开发的,然而,在现实世界应用中,由于数据生成的固有性质和隐私问题,数据往往分布在不同的筒仓上,给有效的大规模培训带来了巨大的挑战。在这项工作中,我们试图通过利用最近联邦学习的成功来克服以上限制。作为在视频瞬间检索领域探索的第一项新任务,新任务被定义为视频瞬时检索,同时提供分布的数据。然后,建议采用名为FedVMR的新型联合学习方法,以促进分散环境中对VMR模型进行大规模和安全的培训。基准数据集实验证明了它的有效性。这是在分散环境中进行安全和高效的VMR培训的首次尝试,它有望为相关研究领域的进一步研究铺平道路。