While learning from synthetic training data has recently gained an increased attention, in real-world robotic applications, there are still performance deficiencies due to the so-called Sim-to-Real gap. In practice, this gap is hard to resolve with only synthetic data. Therefore, we focus on an efficient acquisition of real data within a Sim-to-Real learning pipeline. Concretely, we employ deep Bayesian active learning to minimize manual annotation efforts and devise an autonomous learning paradigm to select the data that is considered useful for the human expert to annotate. To achieve this, a Bayesian Neural Network (BNN) object detector providing reliable uncertainty estimates is adapted to infer the informativeness of the unlabeled data. Furthermore, to cope with mis-alignments of the label distribution in uncertainty-based sampling, we develop an effective randomized sampling strategy that performs favorably compared to other complex alternatives. In our experiments on object classification and detection, we show benefits of our approach and provide evidence that labeling efforts can be reduced significantly. Finally, we demonstrate the practical effectiveness of this idea in a grasping task on an assistive robot.
翻译:最近,在实际世界机器人应用中,从合成培训数据中学习最近引起越来越多的注意,但由于所谓的“Sim-Real”差距,仍然存在着性能缺陷。实际上,这一差距很难用合成数据来弥补。因此,我们注重在“Sim-Real”学习管道内有效获取真实数据。具体地说,我们利用深巴伊西亚积极学习来尽量减少人工批注努力,并设计一种自主学习模式来选择被认为对人类专家有帮助的数据。为此,一个提供可靠不确定性估计数的巴伊西亚神经网络物体探测器(BNN)进行了调整,以推断未贴标签数据的信息性。此外,为了应对基于不确定性的抽样中标签分布的不匹配问题,我们制定了有效的随机抽样战略,与其他复杂的替代方法相比,效果优异。在我们关于物体分类和检测的实验中,我们展示了我们的方法的好处,并提供了证据,说明标签工作可以大大减少。最后,我们展示了这一想法在掌握辅助机器人的任务中的实际效力。