Most city establishments of developing cities are digitally unlabeled because of the lack of automatic annotation systems. Hence location and trajectory services such as Google Maps, Uber etc remain underutilized in such cities. Accurate signboard detection in natural scene images is the foremost task for error-free information retrieval from such city streets. Yet, developing accurate signboard localization system is still an unresolved challenge because of its diverse appearances that include textual images and perplexing backgrounds. We present a novel object detection approach that can detect signboards automatically and is suitable for such cities. We use Faster R-CNN based localization by incorporating two specialized pretraining methods and a run time efficient hyperparameter value selection algorithm. We have taken an incremental approach in reaching our final proposed method through detailed evaluation and comparison with baselines using our constructed SVSO (Street View Signboard Objects) signboard dataset containing signboard natural scene images of six developing countries. We demonstrate state-of-the-art performance of our proposed method on both SVSO dataset and Open Image Dataset. Our proposed method can detect signboards accurately (even if the images contain multiple signboards with diverse shapes and colours in a noisy background) achieving 0.90 mAP (mean average precision) score on SVSO independent test set. Our implementation is available at: https://github.com/sadrultoaha/Signboard-Detection
翻译:由于缺少自动说明系统,大多数发展中城市的市立机构没有数字标签,大多数城市都缺乏自动说明系统。因此,在这些城市中,定位和轨道服务,如谷歌地图、Uber等,仍然没有得到充分利用。自然景象中准确的信号板探测是从这些城市街道检索无误信息的首要任务。然而,开发准确的信号板本地化系统仍是一个尚未解决的挑战,因为其外观多种多样,包括文本图像和令人不解的背景。我们展示了一种新颖的物体探测方法,可以自动检测信号板,并且适合这些城市。我们使用更快的基于 R-CNN 的本地化,方法是采用两种专门的预培训方法,并运行一个有时间效率的超参数值选择算法。我们采取了渐进式方法,通过使用我们建造的 SVSO(Streetroitiveview Sportboard Offites) 的标牌与基准进行详细评估和比较,从而实现我们提出的最终方法。我们在SVSVSO/公开图像数据库的拟议方法可以准确检测(即使我们的图像含有多种标准)在SBSB/SralSBA/SBSBSBSBA 上实现不同的标准。