Generative adversarial networks (GANs) have been promising for many computer vision problems due to their powerful capabilities to enhance the data for training and test. In this paper, we leveraged GANs and proposed a new architecture with a cascaded Single Shot Detector (SSD) for pedestrian detection at distance, which is yet a challenge due to the varied sizes of pedestrians in videos at distance. To overcome the low-resolution issues in pedestrian detection at distance, DCGAN is employed to improve the resolution first to reconstruct more discriminative features for a SSD to detect objects in images or videos. A crucial advantage of our method is that it learns a multi-scale metric to distinguish multiple objects at different distances under one image, while DCGAN serves as an encoder-decoder platform to generate parts of an image that contain better discriminative information. To measure the effectiveness of our proposed method, experiments were carried out on the Canadian Institute for Advanced Research (CIFAR) dataset, and it was demonstrated that the proposed new architecture achieved a much better detection rate, particularly on vehicles and pedestrians at distance, making it highly suitable for smart cities applications that need to discover key objects or pedestrians at distance.
翻译:引发对抗网络(GANs)对于许多计算机视觉问题很有希望,因为它们具有加强培训和测试数据的强大能力。在本文中,我们利用GANs并提议了一个新的建筑,配有级联单一射击探测器(SSD),用于远距离行人探测,由于远距离视频行人规模不同,这仍然是一个挑战。为了克服远距离行人探测中的低分辨率问题,DCGAN正在改进解决方案,首先为SSD重建更具有歧视性的特征,以探测图像或视频中的天体。我们方法的一个关键优势是,它学习了一种多尺度的测量尺度,以辨别不同距离的图像下的多个天体,而DCGAN则作为一个编码器解码平台,以生成含有更好歧视信息的部分图像。为了衡量我们拟议方法的有效性,在加拿大高级研究所(CIFAR)数据集进行了实验,并证明拟议的新结构取得了更好的检测率,特别是在远处的车辆和行人,因此非常适合智能城市在远程发现关键物体或行走物体的应用。