Recent proposed DETR variants have made tremendous progress in various scenarios due to their streamlined processes and remarkable performance. However, the learned queries usually explore the global context to generate the final set prediction, resulting in redundant burdens and unfaithful results. More specifically, a query is commonly responsible for objects of different scales and positions, which is a challenge for the query itself, and will cause spatial resource competition among queries. To alleviate this issue, we propose Team DETR, which leverages query collaboration and position constraints to embrace objects of interest more precisely. We also dynamically cater to each query member's prediction preference, offering the query better scale and spatial priors. In addition, the proposed Team DETR is flexible enough to be adapted to other existing DETR variants without increasing parameters and calculations. Extensive experiments on the COCO dataset show that Team DETR achieves remarkable gains, especially for small and large objects. Code is available at \url{https://github.com/horrible-dong/TeamDETR}.
翻译:最近提议的DETR变式由于简化了过程和出色的表现,在各种设想中取得了巨大进展;然而,所了解的查询通常探讨全球背景,以产生最终的预测,从而造成冗余负担和不真实的结果;更具体地说,查询通常对不同规模和立场的对象负责,这是查询本身的挑战,并将在查询之间造成空间资源竞争;为缓解这一问题,我们建议DETR小组利用查询协作和定位限制来更准确地包容感兴趣的对象;我们还积极满足每个查询成员的预测偏好,提供更佳的查询尺度和空间前科;此外,提议的DETR小组具有足够的灵活性,足以适应现有的其他DETR变式,而不增加参数和计算;COCO数据集的广泛实验表明,DETR小组取得了显著的成果,特别是对小型和大型物体而言。 代码可在\url{https://github.com/horrib-dong/TeamDETR}查阅。