Learning the physical dynamics of deformable objects with particle-based representation has been the objective of many computational models in machine learning. While several state-of-the-art models have achieved this objective in simulated environments, most existing models impose a precondition, such that the input is a sequence of ordered point sets - i.e., the order of the points in each point set must be the same across the entire input sequence. This restrains the model to generalize to real-world data, which is considered to be a sequence of unordered point sets. In this paper, we propose a model named time-wise PointNet (TP-Net) that solves this problem by directly consuming a sequence of unordered point sets to infer the future state of a deformable object with particle-based representation. Our model consists of a shared feature extractor that extracts global features from each input point set in parallel and a prediction network that aggregates and reasons on these features for future prediction. The key concept of our approach is that we use global features rather than local features to achieve invariance to input permutations and ensure the stability and scalability of our model. Experiments demonstrate that our model achieves state-of-the-art performance in both synthetic dataset and in real-world dataset, with real-time prediction speed. We provide quantitative and qualitative analysis on why our approach is more effective and efficient than existing approaches.
翻译:在机器学习中,许多计算模型的目标一直是学习粒子代表制变形物体的物理动态。虽然若干最先进的模型已经在模拟环境中实现了这一目标,但大多数现有模型都设定了一个先决条件,即输入是定序点组合的序列,即每个点集的点顺序在整个输入序列中必须是相同的。这限制了将数据概括到现实世界数据的模型,而这种数据被认为是未定点组合的序列。在本文中,我们提出了一个名为时针点网(TP-Net)的模型,通过直接消耗一系列无序点组合来解决这个问题,从而推断出一个以粒子代表制表示的变形对象的未来状态。我们的模型包括一个共同的特征提取器,从每个输入点组合中提取全球特征,以及一个预测网络,汇总这些特征和原因以进行未来预测。我们方法的关键概念是,我们使用全球特征而不是地方特征来实现输入变异,并确保我们模型中不测值的不测点数和量化方法的稳定性和可缩略性,我们在模型中提供我们实际数据分析的准确性和准确性分析。