Learning multiple domains/tasks with a single model is important for improving data efficiency and lowering inference cost for numerous vision tasks, especially on resource-constrained mobile devices. However, hand-crafting a multi-domain/task model can be both tedious and challenging. This paper proposes a novel approach to automatically learn a multi-path network for multi-domain visual classification on mobile devices. The proposed multi-path network is learned from neural architecture search by applying one reinforcement learning controller for each domain to select the best path in the super-network created from a MobileNetV3-like search space. An adaptive balanced domain prioritization algorithm is proposed to balance optimizing the joint model on multiple domains simultaneously. The determined multi-path model selectively shares parameters across domains in shared nodes while keeping domain-specific parameters within non-shared nodes in individual domain paths. This approach effectively reduces the total number of parameters and FLOPS, encouraging positive knowledge transfer while mitigating negative interference across domains. Extensive evaluations on the Visual Decathlon dataset demonstrate that the proposed multi-path model achieves state-of-the-art performance in terms of accuracy, model size, and FLOPS against other approaches using MobileNetV3-like architectures. Furthermore, the proposed method improves average accuracy over learning single-domain models individually, and reduces the total number of parameters and FLOPS by 78% and 32% respectively, compared to the approach that simply bundles single-domain models for multi-domain learning.
翻译:以单一模式学习多个域/任务,对于提高数据效率和降低许多愿景任务,特别是资源限制的移动设备的数据效率和降低推断成本十分重要。然而,手工制作多域/任务模型既乏味又具有挑战性。本文件提出一种新颖的办法,自动学习多路网络,用于移动设备多域视觉分类。拟议多路网络从神经结构搜索中学习,方法是对每个域应用一个强化学习控制器,以选择从类似移动NetV3搜索空间创建的超级网络中的最佳路径。提议采用适应性均衡域位排序算法,以平衡在多个域上优化联合模型。确定的多路模式在共享节点上有选择地分享跨域参数,同时将特定域参数保留在单个域路径上非共享的节点内。这一办法有效地减少了参数和FLOPS的总数,鼓励积极的知识转让,同时减少跨域的负面干扰。对视觉德曼基数据集的广泛评价显示,拟议的多路模式在单个域中实现了最先进的性业绩,分别使用了精确度、模型大小、最低路路路数据 和单个结构的完整学习方法,从而改进了Sloveal-lVOPS-al-lex-lex-laxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx