Recently, there has been tremendous interest in industry 4.0 infrastructure to address labor shortages in global supply chains. Deploying artificial intelligence-enabled robotic bin picking systems in real world has become particularly important for reducing labor demands and costs while increasing efficiency. To this end, artificial intelligence-enabled robotic bin picking systems may be used to automate bin picking, but may also cause expensive damage during an abnormal event such as a sensor failure. As such, reliability becomes a critical factor for translating artificial intelligence research to real world applications and products. In this paper, we propose a reliable vision system with MultiModal Redundancy (MMRNet) for tackling object detection and segmentation for robotic bin picking using data from different modalities. This is the first system that introduces the concept of multimodal redundancy to combat sensor failure issues during deployment. In particular, we realize the multimodal redundancy framework with a gate fusion module and dynamic ensemble learning. Finally, we present a new label-free multimodal consistency score that utilizes the output from all modalities to measure the overall system output reliability and uncertainty. Through experiments, we demonstrate that in an event of missing modality, our system provides a much more reliable performance compared to baseline models. We also demonstrate that our MC score is a more powerful reliability indicator for outputs during inference time where model generated confidence score are often over-confident.
翻译:最近,人们对工业4.0基础设施产生了巨大的兴趣,以解决全球供应链劳动力短缺问题。在现实世界中部署人工智能机器人垃圾回收系统对于降低劳动力需求和成本,同时提高效率变得特别重要。为此,人工智能机器人垃圾采集系统可能被用于自动挑选垃圾,但也可能在传感器故障等异常事件期间造成昂贵的损坏。因此,可靠性成为将人工智能研究转化为真实世界应用和产品的关键因素。在本文件中,我们提议与多模式再开发公司(MMMRNet)一起建立一个可靠的愿景系统,用以处理利用不同模式的数据对机器人垃圾进行天体探测和分割的问题。这是第一个在部署期间引入多式冗余概念来消除传感器故障问题的系统。特别是,我们用门型集成模块和动态组合学习等异常事件实现多模式冗余框架。最后,我们提出了一个新的无标签的多式联运一致性评分,利用所有模式的产出来衡量整个系统产出的可靠性和不确定性。我们通过实验表明,在缺少模式的情况下,我们的系统提供了比基线模型更可靠的性性业绩比基准模型要高得多。我们常常证明,我们的系统比基准模型的评分值要高得多。