Recent research advance has significantly improved the visual realism of immersive 3D video communication. In this work we present a method to further enhance this immersive experience by adding the hand touch capability ("remote hand clapping"). In our system, each meeting participant sits in front of a large screen with haptic feedback. The local participant can reach his hand out to the screen and perform hand clapping with the remote participant as if the two participants were only separated by a virtual glass. A key challenge in emulating the remote hand touch is the realistic rendering of the participant's hand and arm as the hand touches the screen. When the hand is very close to the screen, the RGBD data required for realistic rendering is no longer available. To tackle this challenge, we present a dual representation of the user's hand. Our dual representation not only preserves the high-quality rendering usually found in recent image-based rendering systems but also allows the hand to reach the screen. This is possible because the dual representation includes both an image-based model and a 3D geometry-based model, with the latter driven by a hand skeleton tracked by a side view camera. In addition, the dual representation provides a distance-based fusion of the image-based and 3D geometry-based models as the hand moves closer to the screen. The result is that the image-based and 3D geometry-based models mutually enhance each other, leading to realistic and seamless rendering. Our experiments demonstrate that our method provides consistent hand contact experience between remote users and improves the immersive experience of 3D video communication.
翻译:最近的研究进展大大改善了隐蔽的 3D 视频通信的视觉现实性。 在这项工作中,我们提出了一个方法,通过增加手触摸能力(“远程手拍手 ” ) 来进一步加强这种隐蔽的经验。在我们的系统中,每个会议参与者坐在一个大屏幕的前面,充满了不便的反馈。当地参与者可以把手伸到屏幕上,像两个参与者只是用虚拟玻璃隔开一样与远程参与者拍手。模拟远程手触摸的一个关键挑战,是参与者手和手臂触摸屏幕时的现实表现。当手触摸屏幕非常接近屏幕时,我们展示出一种进一步加强这种体验的方法。为了应对这一挑战,我们每个会议参与者都展示了用户手的双重表现。我们的双重表现不仅保持了最近基于图像的传输系统中通常发现的高品质,而且还允许手伸到屏幕上。这是可能的,因为双重表现既包括基于图像的模型和基于3D 的模型,而后由手缩动的用户驱动,则不再提供现实的 RGB D 数据。为了应对这一挑战,我们每部的更接近一个更精确的图像,通过一个更精确的缩缩缩缩的图像来提供更精确的图像。</s>