Supervised training of object detectors requires well-annotated large-scale datasets, whose production is costly. Therefore, some efforts have been made to obtain annotations in economical ways, such as cloud sourcing. However, datasets obtained by these methods tend to contain noisy annotations such as inaccurate bounding boxes and incorrect class labels. In this study, we propose a new problem setting of training object detectors on datasets with entangled noises of annotations of class labels and bounding boxes. Our proposed method efficiently decouples the entangled noises, corrects the noisy annotations, and subsequently trains the detector using the corrected annotations. We verified the effectiveness of our proposed method and compared it with the baseline on noisy datasets with different noise levels. The experimental results show that our proposed method significantly outperforms the baseline.
翻译:受监督的物体探测器培训需要有详细说明的大型数据集,其制作成本很高。因此,已作出一些努力,以节省成本的方式获得说明,如云源源;然而,这些方法获得的数据集往往含有噪音性说明,如不准确的捆绑框和不正确的类标签。在本研究报告中,我们提议对带有缠绕类标签和捆绑盒的批注噪音的数据集进行新的目标探测器培训设置问题。我们建议的方法有效地解开缠绕的噪音,纠正吵闹的批注,随后用校正的说明培训探测器。我们核实了我们拟议方法的有效性,并将它与噪音数据集基线的不同噪音水平进行了比较。实验结果显示,我们拟议的方法大大超出了基线。