State-of-the-art activity recognizers are effective during the day, but not trustworthy in the dark. The main causes are the distribution shift from the lower color contrast as well as the limited availability of labeled dark videos. Our goal is to recognize activities in the dark as well as in the day. To compensate for the lack of labeled dark videos, we introduce a pseudo-supervised learning scheme, which utilizes task-irrelevant unlabeled dark videos to train an activity recognizer. Our proposed activity recognizer makes use of audio which is invariant to illumination. However, the usefulness of audio and visual features differs according to the illumination. Thus we propose to make our audio-visual recognizer `darkness-aware'. Experiments on EPIC-Kitchens, Kinetics-Sound, and Charades demonstrate that our proposals enable effective activity recognition in the dark and can even improve robustness to occlusions.
翻译:最先进的活动识别器在白天有效,但在黑暗中不值得信赖。 主要原因是分布变化与低色对比的分布变化以及贴有标签的暗色视频的可用性有限。 我们的目标是识别在黑暗中和白天的活动。 为了弥补缺少贴有标签的暗色视频的情况,我们引入了一个伪监督的学习计划,该计划利用与任务相关的、没有标签的暗色视频来培训活动识别器。 我们拟议的活动识别器使用了不易被照明的音频。 然而,听觉和视觉特征的效用因照明而不同。 因此,我们建议制作我们的视听识别器“ darkness-aware ” 。 在EPIC-Kitchens, 动因子- Sounds 和 Charades 的实验表明,我们的建议能够让黑暗中的有效活动识别,甚至能够提高隐蔽的强度。