Most object manipulation strategies for robots are based on the assumption that the object is rigid (i.e., with fixed geometry) and the goal's details have been fully specified (e.g., the exact target pose). However, there are many tasks that involve spatial relations in human environments where these conditions may be hard to satisfy, e.g., bending and placing a cable inside an unknown container. To develop advanced robotic manipulation capabilities in unstructured environments that avoid these assumptions, we propose a novel long-horizon framework that exploits contrastive planning in finding promising collaborative actions. Using simulation data collected by random actions, we learn an embedding model in a contrastive manner that encodes the spatio-temporal information from successful experiences, which facilitates the subgoal planning through clustering in the latent space. Based on the keypoint correspondence-based action parameterization, we design a leader-follower control scheme for the collaboration between dual arms. All models of our policy are automatically trained in simulation and can be directly transferred to real-world environments. To validate the proposed framework, we conduct a detailed experimental study on a complex scenario subject to environmental and reachability constraints in both simulation and real environments.
翻译:机器人的大多数天体操纵策略所依据的假设是,天体是僵硬的(即固定几何),目标的细节已经完全具体(例如,确切的目标方形),然而,许多任务涉及人类环境中的空间关系,这些条件可能难以满足,例如弯曲和在未知容器内放置电缆。为了在不结构的环境中发展先进的机器人操纵能力,避免这些假设,我们提议了一个新的长视距框架,利用对比性规划寻找有希望的合作行动。我们利用随机行动收集的模拟数据,从成功经验中以对比性的方式学习了嵌入模型,通过在潜在空间集聚来便利次级目标规划。根据关键点对应行动参数的参数,我们设计了双臂之间协作的领先跟踪者控制计划。我们政策的所有模型都是在模拟中自动培训的,可以直接转移到现实世界环境中。为了验证拟议框架,我们进行了关于复杂情景的详细实验研究,在模拟和可实现性两方面都受到环境和可实现性制约。