This report introduces the technical details of the team FuXi-Fresher for LVIS Challenge 2021. Our method focuses on the problem in following two aspects: the long-tail distribution and the segmentation quality of mask and boundary. Based on the advanced HTC instance segmentation algorithm, we connect transformer backbone(Swin-L) through composite connections inspired by CBNetv2 to enhance the baseline results. To alleviate the problem of long-tail distribution, we design a Distribution Balanced method which includes dataset balanced and loss function balaced modules. Further, we use a Mask and Boundary Refinement method composed with mask scoring and refine-mask algorithms to improve the segmentation quality. In addition, we are pleasantly surprised to find that early stopping combined with EMA method can achieve a great improvement. Finally, by using multi-scale testing and increasing the upper limit of the number of objects detected per image, we achieved more than 45.4% boundary AP on the val set of LVIS Challenge 2021. On the test data of LVIS Challenge 2021, we rank 1st and achieve 48.1% AP. Notably, our APr 47.5% is very closed to the APf 48.0%.
翻译:本报告介绍FuXi-FresherLVIS 挑战2021小组的技术细节。我们的方法侧重于以下两个方面的问题:蒙面和边界的长尾分布和分解质量。根据先进的HTC例分化算法,我们通过CBNetv2的复合连接连接变压器主干网(Swin-L)以加强基线结果。为了减轻长尾分布问题,我们设计了一种分配平衡平衡法,其中包括数据集平衡和损耗功能模块。此外,我们用蒙面和边界精细微算法构成的面罩和边界精细化方法来提高分解质量。此外,我们很惊讶地发现,与EMA方法的早期停止结合能够取得很大的改进。最后,通过多尺度测试和增加每个图像所检测对象的上限,我们在LVIS 挑战2021年的val集中实现了超过45.4%的边界AP。关于LVIS挑战2021的测试数据,我们排在1级和48.1%的AP.0至48.1%的APr%已经关闭。