We present in this paper a Garment Similarity Network (GarNet) that learns geometric and physical similarities between known garments by continuously observing a garment while a robot picks it up from a table. The aim is to capture and encode geometric and physical characteristics of a garment into a manifold where a decision can be carried out, such as predicting the garment's shape class and its visually perceived weight. Our approach features an early stop strategy, which means that GarNet does not need to observe the entire video sequence to make a prediction and maintain high prediction accuracy values. In our experiments, we find that GarNet achieves prediction accuracies of 98% for shape classification and 95% for predicting weights. We compare our approach with state-of-art methods, and we observe that our approach advances the state-of-art methods from 70.8% to 98% for shape classification.
翻译:在本文中,我们提出了一个服装相似性网络(GarNet),通过不断观察服装,让机器人从一张桌子上摘取衣服,来学习已知服装之间的几何和物理相似性。目的是捕捉服装的几何和物理特征并将其编码成一个可以作出决定的方块,例如预测服装的形状等级及其视觉觉察到的重量。我们的方法有一个早期停止策略,这意味着GarNet不需要观察整个视频序列来作出预测并保持高预测准确值。在我们的实验中,我们发现GarNet在形状分类方面实现了98%的预测,在重量预测方面实现了95%的预测。我们将我们的方法与最先进的方法进行比较,我们发现我们的方法将最先进的方法从70.8%提高到98%用于形状分类。