Passive millimeter-wave (PMMW) is a significant potential technique for human security screening. Several popular object detection networks have been used for PMMW images. However, restricted by the low resolution and high noise of PMMW images, PMMW hidden object detection based on deep learning usually suffers from low accuracy and low classification confidence. To tackle the above problems, this paper proposes a Task-Aligned Detection Transformer network, named PMMW-DETR. In the first stage, a Denoising Coarse-to-Fine Transformer (DCFT) backbone is designed to extract long- and short-range features in the different scales. In the second stage, we propose the Query Selection module to introduce learned spatial features into the network as prior knowledge, which enhances the semantic perception capability of the network. In the third stage, aiming to improve the classification performance, we perform a Task-Aligned Dual-Head block to decouple the classification and regression tasks. Based on our self-developed PMMW security screening dataset, experimental results including comparison with State-Of-The-Art (SOTA) methods and ablation study demonstrate that the PMMW-DETR obtains higher accuracy and classification confidence than previous works, and exhibits robustness to the PMMW images of low quality.
翻译:使用几个受欢迎的物体探测网络来提取不同规模的长程和短程特征。在第二阶段,我们提议通过查询选择模块在网络中引入学到的空间特征,作为先前的知识,以提高网络的语义感知能力。在第三阶段,为了提高分类性能,我们执行一个任务式的双头建筑块,以区分分类和回归任务。根据我们自行开发的PMMW安全筛选数据集,实验性结果,包括与国家-国家-艺术(SOTA)方法的比较,以及表明MPMW-DETR以往的低质量和图像的可靠程度。