This work explores how to design a single neural network that is capable of adapting to multiple heterogeneous tasks of computer vision, such as image segmentation, 3D detection, and video recognition. This goal is challenging because network architecture designs in different tasks are inconsistent. We solve this challenge by proposing Network Coding Propagation (NCP), a novel "neural predictor", which is able to predict an architecture's performance in multiple datasets and tasks. Unlike prior arts of neural architecture search (NAS) that typically focus on a single task, NCP has several unique benefits. (1) NCP can be trained on different NAS benchmarks, such as NAS-Bench-201 and NAS-Bench-MR, which contains a novel network space designed by us for jointly searching an architecture among multiple tasks, including ImageNet, Cityscapes, KITTI, and HMDB51. (2) NCP learns from network codes but not original data, enabling it to update the architecture efficiently across datasets. (3) Extensive experiments evaluate NCP on object classification, detection, segmentation, and video recognition. For example, with 17\% fewer FLOPs, a single architecture returned by NCP achieves 86\% and 77.16\% on ImageNet-50-1000 and Cityscapes respectively, outperforming its counterparts. More interestingly, NCP enables a single architecture applicable to both image segmentation and video recognition, which achieves competitive performance on both HMDB51 and ADE20K compared to the singular counterparts. Code is available at https://github.com/dingmyu/NCP}{https://github.com/dingmyu/NCP.
翻译:这项工作探索如何设计一个单一的神经网络,能够适应计算机视觉的多种不同任务,如图像分割、3D检测和视频识别。这个目标具有挑战性,因为网络结构设计不同任务不一。我们提出网络编码演示(NCP),这是一个新型的“神经预测器”,能够预测建筑在多个数据集和任务方面的性能。与以往通常侧重于单一任务的神经结构搜索艺术(NAS)不同,NCP具有若干独特的好处。 (1) NCP可以就不同的NAS基准,如NAS-Bench-201和NAS-Bench-MR等对NCP进行培训,因为网络结构的设计具有挑战性。我们设计了一个新的网络空间,以联合搜索多个任务的结构,包括图像网络、城市风景、KITTI和HMDB51。 NCP从网络代码中学习但不是原始数据,使其能够在数据集中有效地更新结构。 (3) 广域实验对NCPC在对象分类、检测、分解和视频识别方面对NCP的NCP,例如17-K-B-NO/C在可应用的图像部分上,在18-LOP和单个图像上分别实现了一个可追溯的版本的图像识别。