Depth adjustment aims to enhance the visual experience of stereoscopic 3D (S3D) images, which accompanied with improving visual comfort and depth perception. For a human expert, the depth adjustment procedure is a sequence of iterative decision making. The human expert iteratively adjusts the depth until he is satisfied with the both levels of visual comfort and the perceived depth. In this work, we present a novel deep reinforcement learning (DRL)-based approach for depth adjustment named VCA-RL (Visual Comfort Aware Reinforcement Learning) to explicitly model human sequential decision making in depth editing operations. We formulate the depth adjustment process as a Markov decision process where actions are defined as camera movement operations to control the distance between the left and right cameras. Our agent is trained based on the guidance of an objective visual comfort assessment metric to learn the optimal sequence of camera movement actions in terms of perceptual aspects in stereoscopic viewing. With extensive experiments and user studies, we show the effectiveness of our VCA-RL model on three different S3D databases.
翻译:深度调整的目的是增加立体立体(S3D)图像的视觉体验,同时提高视觉舒适度和深度感知。对于一位人类专家来说,深度调整程序是迭代决策的顺序。人类专家反复调整深度,直到他对视觉舒适度和感知深度满意。在这项工作中,我们提出了一个以深度调整为新颖的深层强化学习(DRL)法,名为VCA-RL(视觉孔通识强化学习),以在深度编辑操作中明确模拟人类顺序决策。我们把深度调整过程作为Markov决策程序,将行动定义为控制左侧和右侧摄像头之间的距离的相机移动操作。我们的代理根据客观视觉舒适评估参数的指导,以了解立体观观中摄像活动的最佳顺序。我们通过广泛的实验和用户研究,在三个不同的S3D数据库中展示了我们的VCA-RL模型的有效性。