Efficient and reliable methods for training of object detectors are in higher demand than ever, and more and more data relevant to the field is becoming available. However, large datasets like Open Images Dataset v4 (OID) are sparsely annotated, and some measure must be taken in order to ensure the training of a reliable detector. In order to take the incompleteness of these datasets into account, one possibility is to use pretrained models to detect the presence of the unverified objects. However, the performance of such a strategy depends largely on the power of the pretrained model. In this study, we propose part-aware sampling, a method that uses human intuition for the hierarchical relation between objects. In terse terms, our method works by making assumptions like "a bounding box for a car should contain a bounding box for a tire". We demonstrate the power of our method on OID and compare the performance against a method based on a pretrained model. Our method also won the first and second place on the public and private test sets of the Google AI Open Images Competition 2018.
翻译:然而,对像开放图像数据集v4(OID)这样的大型数据集的附加说明很少,必须采取某种措施以确保培训可靠的探测器。为了考虑到这些数据集的不完整性,一种可能性是使用预先培训的模型来探测未核实的物体的存在。然而,这种战略的实施在很大程度上取决于预先培训的模型的力量。在本研究中,我们提议采用部分认知取样方法,即对物体之间的等级关系使用人类直觉的方法。用梯词来说,我们的方法是假设“汽车的捆绑盒应该包含轮胎的捆绑盒”。我们展示我们关于OID的方法的力量,并对照以预先培训模型为基础的方法对性能进行比较。我们的方法在2018年谷歌开放图像竞赛的公开测试组中也赢得了第一和第二位。