Access to large and diverse computer-aided design (CAD) drawings is critical for developing symbol spotting algorithms. In this paper, we present FloorPlanCAD, a large-scale real-world CAD drawing dataset containing over 10,000 floor plans, ranging from residential to commercial buildings. CAD drawings in the dataset are all represented as vector graphics, which enable us to provide line-grained annotations of 30 object categories. Equipped by such annotations, we introduce the task of panoptic symbol spotting, which requires to spot not only instances of countable things, but also the semantic of uncountable stuff. Aiming to solve this task, we propose a novel method by combining Graph Convolutional Networks (GCNs) with Convolutional Neural Networks (CNNs), which captures both non-Euclidean and Euclidean features and can be trained end-to-end. The proposed CNN-GCN method achieved state-of-the-art (SOTA) performance on the task of semantic symbol spotting, and help us build a baseline network for the panoptic symbol spotting task. Our contributions are three-fold: 1) to the best of our knowledge, the presented CAD drawing dataset is the first of its kind; 2) the panoptic symbol spotting task considers the spotting of both thing instances and stuff semantic as one recognition problem; and 3) we presented a baseline solution to the panoptic symbol spotting task based on a novel CNN-GCN method, which achieved SOTA performance on semantic symbol spotting. We believe that these contributions will boost research in related areas.
翻译:使用大型和多种计算机辅助设计( CAD) 绘图对于开发符号检测算法至关重要。 在本文中, 我们展示了一个大型的、 真实的 CAD 绘图数据集, 包含10,000多个楼层, 从住宅楼到商业楼层。 数据集中的 CAD 绘图都作为矢量图形来显示, 使我们能够提供30个对象类别的线性图解。 有了这些说明, 我们引入了光学符号检测任务, 不仅需要识别可计数事物的事例, 还需要识别不可计数物品的语义。 为了解决这个问题, 我们提出了一个创新的方法, 将图集网络( GCNs) 和 Convolucial Neural 网络( CNNs) 合并起来, 记录非欧元和 Eucloidean 的特性, 并进行端对端到端的训练。 拟议的CNNGN- GCN 方法在标定点性( SOTA) 任务中不仅识别可计数的事物, 还要帮助我们构建一个直线性符号识别符号的标志的标志性贡献。 我们的直径直线性网络性任务的SOIL 。 任务的SOI) 将展示的SOIxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx