Recently, deep Convolutional Neural Networks (CNNs) can achieve human-level performance in edge detection with the rich and abstract edge representation capacities. However, the high performance of CNN based edge detection is achieved with a large pretrained CNN backbone, which is memory and energy consuming. In addition, it is surprising that the previous wisdom from the traditional edge detectors, such as Canny, Sobel, and LBP are rarely investigated in the rapid-developing deep learning era. To address these issues, we propose a simple, lightweight yet effective architecture named Pixel Difference Network (PiDiNet) for efficient edge detection. Extensive experiments on BSDS500, NYUD, and Multicue are provided to demonstrate its effectiveness, and its high training and inference efficiency. Surprisingly, when training from scratch with only the BSDS500 and VOC datasets, PiDiNet can surpass the recorded result of human perception (0.807 vs. 0.803 in ODS F-measure) on the BSDS500 dataset with 100 FPS and less than 1M parameters. A faster version of PiDiNet with less than 0.1M parameters can still achieve comparable performance among state of the arts with 200 FPS. Results on the NYUD and Multicue datasets show similar observations. The codes are available at https://github.com/zhuoinoulu/pidinet.
翻译:最近,深层革命神经网络(CNNs)能够以丰富和抽象的边缘代表能力在边缘检测中取得人类水平的高级表现,然而,有线电视新闻网基于边缘检测的高性能是通过受过训练的大型CNN骨干才能达到的,这是记忆和能源消耗。此外,令人惊讶的是,传统的边缘探测器,如坎尼、索贝尔和LBP(Canny、Sobel和LBP)先前的智慧很少在迅速发展的深层次学习时代得到调查。为了解决这些问题,我们提议建立一个简单、轻而有效的结构,名为像素差异网络(PiDiDiNet),以高效的边缘检测。在BSDS500、NYUD和Mulucue上进行广泛的实验,以展示其有效性、高培训和推断效率。令人惊讶的是,在仅利用BSDS500和VOC数据集从零下进行培训时,PidiNet能够超过人类感知的结果记录(0.807和ODS/FSY的0.803),在100个FDS/com的参数中仍然低于1M参数。PDISNUDO的快速版本,在FDSUDS的FSDFSDSDSDS的类似结果中可以比较的演示中可以实现。