Wireless tags are increasingly used to track and identify common items of interest such as retail goods, food, medicine, clothing, books, documents, keys, equipment, and more. At the same time, there is a need for labelled visual data featuring such items for the purpose of training object detection and recognition models for robots operating in homes, warehouses, stores, libraries, pharmacies, and so on. In this paper, we ask: can we leverage the tracking and identification capabilities of such tags as a basis for a large-scale automatic image annotation system for robotic perception tasks? We present RF-Annotate, a pipeline for autonomous pixel-wise image annotation which enables robots to collect labelled visual data of objects of interest as they encounter them within their environment. Our pipeline uses unmodified commodity RFID readers and RGB-D cameras, and exploits arbitrary small-scale motions afforded by mobile robotic platforms to spatially map RFIDs to corresponding objects in the scene. Our only assumption is that the objects of interest within the environment are pre-tagged with inexpensive battery-free RFIDs costing 3-15 cents each. We demonstrate the efficacy of our pipeline on several RGB-D sequences of tabletop scenes featuring common objects in a variety of indoor environments.
翻译:在本文中,我们要求:我们能否利用这些标签的追踪和识别能力,作为大规模自动图像说明系统的基础,用于机器人感知任务?我们提供RF-Annotate,这是一条自动像素图像说明的管道,使机器人能够收集这些物品在其环境中遇到的物品的标签视觉数据。我们的管道使用未经改装的商品RFID阅读器和RGB-D摄像机,利用移动式机器人平台提供的任意小型移动式移动式移动式移动式移动式移动式移动式机器人平台,将RFIDs在空间上映到现场的相应物体。我们的唯一假设是,环境中的受关注对象事先配有价格低廉的无电池RFIDs,费用为每台3-15美分。我们展示了我们室内各层的RGB-D相形天体。