Prior human parsing models are limited to parsing humans into classes pre-defined in the training data, which is not flexible to generalize to unseen classes, e.g., new clothing in fashion analysis. In this paper, we propose a new problem named one-shot human parsing (OSHP) that requires to parse human into an open set of reference classes defined by any single reference example. During training, only base classes defined in the training set are exposed, which can overlap with part of reference classes. In this paper, we devise a novel Progressive One-shot Parsing network (POPNet) to address two critical challenges , i.e., testing bias and small sizes. POPNet consists of two collaborative metric learning modules named Attention Guidance Module and Nearest Centroid Module, which can learn representative prototypes for base classes and quickly transfer the ability to unseen classes during testing, thereby reducing testing bias. Moreover, POPNet adopts a progressive human parsing framework that can incorporate the learned knowledge of parent classes at the coarse granularity to help recognize the descendant classes at the fine granularity, thereby handling the small sizes issue. Experiments on the ATR-OS benchmark tailored for OSHP demonstrate POPNet outperforms other representative one-shot segmentation models by large margins and establishes a strong baseline. Source code can be found at https://github.com/Charleshhy/One-shot-Human-Parsing.
翻译:人类先前的剖析模型仅限于将人类分为培训数据中预先界定的班级,而培训数据中则不灵活地将人类分为一般化为看不见的班级,例如,新服装和时装分析。在本文件中,我们提出了一个名为单向人类剖析的新问题,要求将人类剖析成一个由任何一个参考示例定义的开放的参考班(OSHP),在培训期间,仅将培训组中定义的基础班暴露,这可能与参考班部分重叠。在本文中,我们设计了一个新的进步一发分解网络(POPNet),以应对两个关键挑战,即测试偏向和小尺寸。POPNet由两个名为注意指导模块和近距离中心模块的两个协作性指标学习模块组成,这两个模块可以学习基础班的代表性模型,并在测试期间将能力迅速转移到隐蔽班,从而减少测试偏差。此外,POPNet采用了一个进步的人类分解框架,可以纳入在粗度颗粒的家长班所学的知识,以帮助在精细的OS坚固度上识别后裔班级,从而在基础化基础化的SOS-OB-OBER-S-S-imalimal-deal 版模型中,从而处理一个大底基级数据库级数据库级数据库。在基础化的模型中,通过一个基础化的底基段测试,以展示一个基础化的底部/底部/底基段演示。