The localization quality of automatic object detectors is typically evaluated by the Intersection over Union (IoU) score. In this work, we show that humans have a different view on localization quality. To evaluate this, we conduct a survey with more than 70 participants. Results show that for localization errors with the exact same IoU score, humans might not consider that these errors are equal, and express a preference. Our work is the first to evaluate IoU with humans and makes it clear that relying on IoU scores alone to evaluate localization errors might not be sufficient.
翻译:自动天体探测器的本地化质量通常通过交叉比对联盟(IoU)评分来评估。 在这项工作中,我们证明人类对本地化质量有不同的看法。为了评估这一点,我们进行了70多名参与者的调查。结果显示,对于具有完全相同的IoU评分的本地化错误,人类可能不会认为这些错误是平等的,而是表示偏好。我们的工作是首先对人类的IoU评分,并表明仅仅依靠IoU评分来评估本地化错误可能是不够的。