In this paper, we propose an affordance model, which is built on Conditional Neural Processes, that can predict effect trajectories given objects, action or effect information at any time. Affordances are represented in a latent representation that combines object, action and effect channels. This model allows us to make predictions of intermediate effects expected to be obtained from partial action executions, and this capability is used to make multi-step plans that include partial actions in order to achieve goals. We first show that our model can make accurate continuous effect predictions. We compared our model with a recent LSTM-based effect predictor using an existing dataset that includes lever-up actions. Next, we showed that our model can generate accurate effect predictions for push and grasp actions. Finally, we showed that our system can generate successful multi-step plans in order to bring objects to desired positions. Importantly, the proposed system generated more accurate and effective plans with partial action executions compared to plans that only consider full action executions. Although continuous effect prediction and multi-step planning based on learning affordances have been studied in the literature, continuous affordance and effect predictions have not been utilized in making accurate and fine-grained plans.
翻译:在本文中,我们提出了一个价格模型,该模型以有条件神经过程为基础,可以随时预测给定对象、行动或效果信息的效果轨迹; 价格模型代表着一种将物体、行动和效果渠道结合起来的潜在代表形式; 这个模型使我们能够预测部分行动处决预期会产生的中间效果, 并且利用这一能力来制定多步骤计划, 包括部分行动以实现目标; 我们首先显示我们的模型可以作出准确的持续效果预测; 我们用包括杠杆行动在内的现有数据集将我们的模型与最近的LSTM效应预测器进行了比较; 接下来, 我们显示我们的模型可以产生精确的效果预测, 推动和抓住行动。 最后, 我们表明我们的系统可以产生成功的多步骤计划, 使目标达到预期的位置。 重要的是, 与只考虑完全行动处决的计划相比,拟议的系统产生了更准确和有效的计划, 与只考虑完全行动处决的计划相比, 。 虽然在文献中研究了持续效果预测和基于学习价格预测的多步骤规划, 并且没有在精确的计划中加以利用。</s>