Cartoon face detection is a more challenging task than human face detection due to many difficult scenarios is involved. Aiming at the characteristics of cartoon faces, such as huge differences within the intra-faces, in this paper, we propose an asymmetric cartoon face detector, named ACFD. Specifically, it consists of the following modules: a novel backbone VoVNetV3 comprised of several asymmetric one-shot aggregation modules (AOSA), asymmetric bi-directional feature pyramid network (ABi-FPN), dynamic anchor match strategy (DAM) and the corresponding margin binary classification loss (MBC). In particular, to generate features with diverse receptive fields, multi-scale pyramid features are extracted by VoVNetV3, and then fused and enhanced simultaneously by ABi-FPN for handling the faces in some extreme poses and have disparate aspect ratios. Besides, DAM is used to match enough high-quality anchors for each face, and MBC is for the strong power of discrimination. With the effectiveness of these modules, our ACFD achieves the 1st place on the detection track of 2020 iCartoon Face Challenge under the constraints of model size 200MB, inference time 50ms per image, and without any pretrained models.
翻译:由于许多困难的情景,卡通面部检测是一项比人类面部检测更具挑战性的任务。在本文中,我们针对卡通面孔的特征,例如脸部内部的巨大差异,提出了名为ACFD的不对称卡通脸探测器。具体地说,它由以下模块组成:由若干不对称单发集成模块(AOSA)、双向双向特征金字塔网络(ABi-FPN)、动态锚比战略(DAM)和相应的差值二进制损失(MBC)组成的新的骨干VVOVNetVNetV3,然后由ABI-FDN同时结合和强化,用于处理某些极端面部位的脸部和有差异的方面比率。此外,DAM用来匹配每个脸部的足够高质量的锚,MBC是强大的歧视力量。由于这些模块的有效性,我们ACDF在2020年iCartoon Face的探测轨道上取得了第一个位置,由VVNVNetV3提取,然后由ABI-FN在模型规模200MB的制约下,在50号前挑战下,在50岁前任何时间。