We focus on enabling damage and tampering detection in logistics and tackle the problem of 3D shape reconstruction of potentially damaged parcels. As input we utilize single RGB images, which corresponds to use-cases where only simple handheld devices are available, e.g. for postmen during delivery or clients on delivery. We present a novel synthetic dataset, named Parcel3D, that is based on the Google Scanned Objects (GSO) dataset and consists of more than 13,000 images of parcels with full 3D annotations. The dataset contains intact, i.e. cuboid-shaped, parcels and damaged parcels, which were generated in simulations. We work towards detecting mishandling of parcels by presenting a novel architecture called CubeRefine R-CNN, which combines estimating a 3D bounding box with an iterative mesh refinement. We benchmark our approach on Parcel3D and an existing dataset of cuboid-shaped parcels in real-world scenarios. Our results show, that while training on Parcel3D enables transfer to the real world, enabling reliable deployment in real-world scenarios is still challenging. CubeRefine R-CNN yields competitive performance in terms of Mesh AP and is the only model that directly enables deformation assessment by 3D mesh comparison and tampering detection by comparing viewpoint invariant parcel side surface representations. Dataset and code are available at https://a-nau.github.io/parcel3d.
翻译:我们专注于可能损坏包裹的损坏和篡改检测,并解决三维形状重建的问题。我们采用单个RGB图像作为输入,这对应于只有简单手持设备可用的用例,例如邮递员在递送或客户在递送期间。我们提供了一个名为Parcel3D的新型合成数据集,该数据集基于Google扫描物体(GSO)数据集,包含逾13,000个被完整三维标注的包裹。数据集包含未受损的包裹,即长方体形状的包裹和在模拟中产生的损坏包裹。我们致力于通过提出一种新颖的体系结构来检测包裹的误处理,该体系结构称为CubeRefine R-CNN,它将估计3D边界框与迭代网格细化相结合。我们在Parcel3D和现有的长方体形状的包裹数据集上进行了基准测试。我们的结果表明,虽然在Parcel3D上训练可以实现转移至现实世界,但在现实世界的部署仍然具有挑战性。CubeRefine R-CNN在Mesh AP方面具有竞争力,并且是唯一一个直接通过比较视点不变的包裹侧表面表示来实现变形评估和篡改检测的模型。该数据集和代码可在https://a-nau.github.io/parcel3d 上获得。