For robot manipulation, a complete and accurate object shape is desirable. Here, we present a method that combines visual and haptic reconstruction in a closed-loop pipeline. From an initial viewpoint, the object shape is reconstructed using an implicit surface deep neural network. The location with highest uncertainty is selected for haptic exploration, the object is touched, the new information from touch and a new point cloud from the camera are added, object position is re-estimated and the cycle is repeated. We extend Rustler et al. (2022) by using a new theoretically grounded method to determine the points with highest uncertainty, and we increase the yield of every haptic exploration by adding not only the contact points to the point cloud but also incorporating the empty space established through the robot movement to the object. Additionally, the solution is compact in that the jaws of a closed two-finger gripper are directly used for exploration. The object position is re-estimated after every robot action and multiple objects can be present simultaneously on the table. We achieve a steady improvement with every touch using three different metrics and demonstrate the utility of the better shape reconstruction in grasping experiments on the real robot. On average, grasp success rate increases from 63.3% to 70.4% after a single exploratory touch and to 82.7% after five touches. The collected data and code are publicly available (https://osf.io/j6rkd/, https://github.com/ctu-vras/vishac)
翻译:对于机器人操作来说, 一个完整和准确的物体形状是可取的。 在这里, 我们提出一种方法, 将视觉和偶然重建结合到封闭环状管道的管道中。 从最初的角度看, 对象形状是使用隐含的表面深神经网络来重建的。 选择最不确定的位置是为了进行偶然的探索, 对象被触动, 接触中的新信息以及相机中的新点云被添加, 物体位置被重新估计, 循环循环被重复。 我们使用新的理论基础方法来确定具有最高不确定性的点, 并且我们不仅在点云上添加接触点, 而且还将机器人运动到目标上建立的空空间纳入其中, 从而增加了每次偶然探索的收益。 此外, 解决方案是紧凑, 闭着的两指柄抓柄的下巴直接用于勘探, 对象位置被重新估计, 每个机器人动作和多个物体都可以同时出现在桌面上。 我们使用三种不同的标准来稳步改进所有接触, 并展示在真正机器人的实验中进行更好的形状重建的效用。 平均, 将20 / 和85 的 成功率 提高到了 。 。 。</s>