Light field saliency detection -- important due to utility in many vision tasks -- still lacks speed and can improve in accuracy. Due to the formulation of the saliency detection problem in light fields as a segmentation task or a memorizing task, existing approaches consume unnecessarily large amounts of computational resources for training, and have longer execution times for testing. We solve this by aggressively reducing the large light field images to a much smaller three-channel feature map appropriate for saliency detection using an RGB image saliency detector with attention mechanisms. We achieve this by introducing a novel convolutional neural network based features extraction and encoding module. Our saliency detector takes $0.4$ s to process a light field of size $9\times9\times512\times375$ in a CPU and is significantly faster than state-of-the-art light field saliency detectors, with better or comparable accuracy. Furthermore, model size of our architecture is significantly lower compared to state-of-the-art light field saliency detectors. Our work shows that extracting features from light fields through aggressive size reduction and the attention mechanism results in a faster and accurate light field saliency detector leading to near real-time light field processing.
翻译:光外显眼探测 -- -- 在许多视觉任务中由于实用性而很重要 -- -- 仍然缺乏速度,而且能够提高准确性。由于在光字段中作为分解任务或混合任务而提出突出的探测问题,现有方法消耗了不必要的大量计算资源用于培训,并有较长的测试执行时间。我们通过将大型光场图像刻录成一个小得多的三道特征地图来解决这一点,该地图适合于使用RGB图像显眼探测器和关注性机制进行突出检测。我们通过引入基于特征提取和编码模块的新型同步神经网络来实现这一目标。我们的显性探测器用0.4美元处理一个大小为9\times9\time512\time375的光字段,在CPU中处理一个大小为9\times512\time375美元的光域,比最先进的光场显眼探测器要快得多,精确性。此外,我们建筑的模型尺寸比最先进的光光场显眼探测器要低得多。我们的工作显示通过快速的缩小了光场的尺寸和关注机制从光场中提取特征。