Many vision tasks use secondary information at inference time -- a seed -- to assist a computer vision model in solving a problem. For example, an initial bounding box is needed to initialize visual object tracking. To date, all such work makes the assumption that the seed is a good one. However, in practice, from crowdsourcing to noisy automated seeds, this is often not the case. We hence propose the problem of seed rejection -- determining whether to reject a seed based on the expected performance degradation when it is provided in place of a gold-standard seed. We provide a formal definition to this problem, and focus on two meaningful subgoals: understanding causes of error and understanding the model's response to noisy seeds conditioned on the primary input. With these goals in mind, we propose a novel training method and evaluation metrics for the seed rejection problem. We then use seeded versions of the viewpoint estimation and fine-grained classification tasks to evaluate these contributions. In these experiments, we show our method can reduce the number of seeds that need to be reviewed for a target performance by over 23% compared to strong baselines.
翻译:许多愿景任务在推论时间使用次级信息 -- -- 种子 -- -- 来帮助计算机愿景模型解决问题。例如,需要初始约束框来启动视觉对象跟踪。到目前为止,所有此类工作都假设种子是一个好种子。然而,实际上,从众包到吵闹的自动种子,情况往往并非如此。因此,我们提出了种子拒绝的问题 -- -- 确定是否根据预期的性能退化来拒绝种子,以取代金质标准种子。我们提供了这一问题的正式定义,并侧重于两个有意义的次级目标:了解错误原因和理解模型对以原始投入为条件的噪音种子的反应。我们考虑到这些目标,我们提出了新的培训方法和种子拒绝问题的评价指标。然后我们用种子版的观点估计和精细的分类任务来评估这些贡献。在这些实验中,我们展示了我们的方法可以减少目标性能需要审查的种子数量,比强基线减少23%以上。