Pavement Distress Recognition (PDR) is an important step in pavement inspection and can be powered by image-based automation to expedite the process and reduce labor costs. Pavement images are often in high-resolution with a low ratio of distressed to non-distressed areas. Advanced approaches leverage these properties via dividing images into patches and explore discriminative features in the scale space. However, these approaches usually suffer from information loss during image resizing and low efficiency due to complex learning frameworks. In this paper, we propose a novel and efficient method for PDR. A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing, and can be flexibly plugged into the image classification network as a pre-network to exploit resolution and scale information. In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information across different feature granularities and scales. The mined information is passed along to the resized images to yield an informative image pyramid to assist the image classification network for PDR. We applied our method to three well-known Convolutional Neural Networks (CNNs), and conducted an evaluation on a large-scale pavement image dataset named CQU-BPDD. Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of these CNN models and show that the simple combination of KIPRN and EfficientNet-B3 significantly outperforms the state-of-the-art patch-based method in both performance and efficiency.
翻译:PDR 承认(PDR) 是路面检查的一个重要步骤, 可以通过图像自动化获得动力, 以加快进程和降低劳动力成本。 平面图像通常具有高分辨率, 受困与非受压区域的比例较低。 高级方法通过将图像分割成补丁和探索规模空间中的歧视性特征来利用这些属性。 但是, 这些方法通常会因图像重新定位和因复杂的学习框架而降低效率而导致信息丢失。 在本文中, 我们提议为PRODR提供一个新颖的高效方法。 一个名为 Kernel Inversed Pyramid 重新定位网络( KIPRN) 的光网络被引入了图像重塑, 并且可以灵活地插入图像分类网络, 作为一种利用分辨率和规模信息的预网络。 在 KIPRN3 系统中, 金字形和内内骨内骨内骨内骨内骨外的偏差信息被专门用来清除不同特征和规模的歧视性信息。 这些采矿信息被传递到一个信息更新的图像图像分类系统。 我们将我们的方法灵活地插入了一个清晰的平面的平面平面图像评估系统系统, 。