Self-occlusion is challenging for cloth manipulation, as it makes it difficult to estimate the full state of the cloth. Ideally, a robot trying to unfold a crumpled or folded cloth should be able to reason about the cloth's occluded regions. We leverage recent advances in pose estimation for cloth to build a system that uses explicit occlusion reasoning to unfold a crumpled cloth. Specifically, we first learn a model to reconstruct the mesh of the cloth. However, the model will likely have errors due to the complexities of the cloth configurations and due to ambiguities from occlusions. Our main insight is that we can further refine the predicted reconstruction by performing test-time finetuning with self-supervised losses. The obtained reconstructed mesh allows us to use a mesh-based dynamics model for planning while reasoning about occlusions. We evaluate our system both on cloth flattening as well as on cloth canonicalization, in which the objective is to manipulate the cloth into a canonical pose. Our experiments show that our method significantly outperforms prior methods that do not explicitly account for occlusions or perform test-time optimization. Videos and visualizations can be found on our $\href{https://sites.google.com/view/occlusion-reason/home}{\text{project website}}.$
翻译:自我封闭对于布料操纵来说具有挑战性,因为它使得很难估计布料的完整状态。 理想的是, 试图展出折叠或折叠布的机器人应该能够解释布布的隐蔽区域。 我们利用最近布面估算的进展来建立一个系统, 使用明确的隐蔽推理来展示一块被压碎的布料。 具体地说, 我们首先学习一个重建布料网的模型。 但是, 由于布料配置的复杂性和隐蔽的模糊性,模型可能会有错误。 我们的主要洞察力是, 我们可以用自我监督损失来进行测试- 时间微调来进一步改进预测的重建。 获得的重塑网格允许我们使用基于网格的动态模型来进行布料规划, 同时推理出布料的封闭性推理。 我们既在布板板上,又在布料库层化上评估我们的系统, 目的是将布料操纵成一个罐体。 我们的实验显示, 我们的方法将大大地超越了先前的方法, 而没有明确计算到美化网站 或测试- 测试- 优化网站 。