This paper presents a novel attention-based neural network for structured reconstruction, which takes a 2D raster image as an input and reconstructs a planar graph depicting an underlying geometric structure. The approach detects corners and classifies edge candidates between corners in an end-to-end manner. Our contribution is a holistic edge classification architecture, which 1) initializes the feature of an edge candidate by a trigonometric positional encoding of its end-points; 2) fuses image feature to each edge candidate by deformable attention; 3) employs two weight-sharing Transformer decoders to learn holistic structural patterns over the graph edge candidates; and 4) is trained with a masked learning strategy. The corner detector is a variant of the edge classification architecture, adapted to operate on pixels as corner candidates. We conduct experiments on two structured reconstruction tasks: outdoor building architecture and indoor floorplan planar graph reconstruction. Extensive qualitative and quantitative evaluations demonstrate the superiority of our approach over the state of the art. Code and pre-trained models are available at https://heat-structured-reconstruction.github.io.
翻译:本文展示了一个新的以关注为基础的结构性重建神经网络,该神经网络以2D光栅图像为投入,并重建了描述基本几何结构的平面图。该方法以端到端的方式探测角,对角之间的边缘候选人进行分类。我们的贡献是一个整体边缘分类结构,1)通过对其终点进行三角测量定位编码,初步确定边缘候选人的特征;2)通过变形关注将图像特征结合给每个边缘候选人;3)使用两个重量共享的变异器在图形边缘候选人中学习整体结构模式;4)用蒙面学习战略进行培训。角探测器是边缘分类结构的一种变体,在像素上作为角候选人运作。我们进行两项结构化重建任务的实验:户外建筑结构和室内地平面规划图重建。广泛的定性和定量评价表明我们的方法优于艺术状态。代码和预先培训的模式可在https://heat-structurd-reformation.github.io查阅。