We present a novel framework for finding a set of tight bounding boxes of a 3D shape via neural-network-based over-segmentation and iterative merging and refinement. Achieving tight bounding boxes of a shape while guaranteeing the complete boundness is an essential task for efficient geometric operations and unsupervised semantic part detection, but previous methods fail to achieve both full coverage and tightness. Neural-network-based methods are not suitable for these goals due to the non-differentiability of the objective, and also classic iterative search methods suffer from their sensitivity to the initialization. We demonstrate that the best integration of the learning-based and iterative search methods can achieve the bounding boxes with both properties. We employ an existing unsupervised segmentation network to split the shape and obtain over-segmentation. Then, we apply hierarchical merging with our novel tightness-aware merging and stopping criteria. To overcome the sensitivity to the initialization, we also refine the bounding box parameters in a game setup with a soft reward function promoting a wider exploration. Lastly, we further improve the bounding boxes with a MCTS-based multi-action space exploration. Our experimental results demonstrate the full coverage, tightness, and the adequate number of bounding boxes of our method.
翻译:我们提出了一种新颖的框架,通过基于神经网络的学习分割和迭代合并和精化来找到 3D 形状的一组紧 bounding boxes。实现形状的紧 bounding boxes,同时保证完整的 boundness,是高效几何运算和无监督语义部件检测的重要任务,但以前的方法未能同时实现完全覆盖和紧密性。基于神经网络的方法由于目标的不可微分性而不适用于这些目标,而传统的迭代搜索方法也因其对初始化的敏感性而受苦。我们展示了学习和迭代搜索方法最佳集成可以实现这两种属性的 bounding boxes。我们采用现有的无监督分割网络来分割形状并获得过分割。然后,我们使用我们的新颖 tightness-aware merging 和停止条件进行分层合并。为了克服对初始化的敏感性,我们还通过软奖励函数在游戏设置中调整 bounding box 参数,促进更广泛的 exploration。最后,我们基于 MCTS 的多操作空间探索进一步改进 bounding boxes。我们的实验结果证明了方法的完全覆盖、紧密性和适当数目的 bounding boxes。