The xView2 competition and xBD dataset spurred significant advancements in overhead building damage detection, but the competition's pixel level scoring can lead to reduced solution performance in areas with tight clusters of buildings or uninformative context. We seek to advance automatic building damage assessment for disaster relief by proposing an auxiliary challenge to the original xView2 competition. This new challenge involves a new dataset and metrics indicating solution performance when damage is more local and limited than in xBD. Our challenge measures a network's ability to identify individual buildings and their damage level without excessive reliance on the buildings' surroundings. Methods that succeed on this challenge will provide more fine-grained, precise damage information than original xView2 solutions. The best-performing xView2 networks' performances dropped noticeably in our new limited/local damage detection task. The common causes of failure observed are that (1) building objects and their classifications are not separated well, and (2) when they are, the classification is strongly biased by surrounding buildings and other damage context. Thus, we release our augmented version of the dataset with additional object-level scoring metrics https://gitlab.kitware.com/dennis.melamed/xfbd to test independence and separability of building objects, alongside the pixel-level performance metrics of the original competition. We also experiment with new baseline models which improve independence and separability of building damage predictions. Our results indicate that building damage detection is not a fully-solved problem, and we invite others to use and build on our dataset augmentations and metrics.
翻译:XView2 竞赛和 xBD 数据集在管理大楼损坏探测方面大大提升了管理费建筑损坏的探测水平,但竞争的像素级评分可以降低建筑群密集或无信息化的地区的解决方案性能。我们试图通过提出对最初的 xView2 竞争的附带挑战,推进救灾的自动建筑损坏评估。这项新的挑战涉及一个新的数据集和计量,表明当损坏比 xBD 中更为局部和有限时的解决方案性能。我们的挑战衡量一个网络在不过分依赖建筑物周围环境的情况下确定单个建筑物及其损坏程度的能力。在这项挑战上取得成功的方法将比原始的 xVi2 解决方案提供更精细、准确的损坏信息。我们新的有限/局部损坏探测任务中表现最佳的xView2网络性能明显下降。观察到的常见失败原因是:(1) 建筑物体及其分类没有很好地分开,(2) 周围建筑和其他损坏环境环境的分类严重偏差。因此,我们发行了扩大的数据集版本,增加了目标级评分指标,没有显示建筑/ Geblab 的稳定性测试/ developmentalmentalalizalalizalizalizalizal