Face parsing aims to predict pixel-wise labels for facial components of a target face in an image. Existing approaches usually crop the target face from the input image with respect to a bounding box calculated during pre-processing, and thus can only parse inner facial Regions of Interest (RoIs). Peripheral regions like hair are ignored and nearby faces that are partially included in the bounding box can cause distractions. Moreover, these methods are only trained and evaluated on near-frontal portrait images and thus their performance for in-the-wild cases were unexplored. To address these issues, this paper makes three contributions. First, we introduce iBugMask dataset for face parsing in the wild containing 1,000 manually annotated images with large variations in sizes, poses, expressions and background, and Helen-LP, a large-pose training set containing 21,866 images generated using head pose augmentation. Second, we propose RoI Tanh-polar transform that warps the whole image to a Tanh-polar representation with a fixed ratio between the face area and the context, guided by the target bounding box. The new representation contains all information in the original image, and allows for rotation equivariance in the convolutional neural networks (CNNs). Third, we propose a hybrid residual representation learning block, coined HybridBlock, that contains convolutional layers in both the Tanh-polar space and the Tanh-Cartesian space, allowing for receptive fields of different shapes in CNNs. Through extensive experiments, we show that the proposed method significantly improves the state-of-the-art for face parsing in the wild.
翻译:面部剖析的目的是预测目标图像面部部分面部的像素标签。 现有方法通常从预处理期间计算成的捆绑框中输入图像, 将目标面部从输入图像中切除出来, 因而只能剖析内部面部区域( Rois ) 。 围框中部分包含毛发的周边区域被忽略, 附近面部可引起分心。 此外, 这些方法仅在近前侧肖像上进行训练和评价, 因而它们对于圆形中的案件的性能是未解析的。 为了解决这些问题, 本文做出三项贡献。 首先, 我们引入 iBugMask 数据集用于在野外进行面面部切切换, 包含1,000张手动的附加图象, 在大小、 面部、 表达和背景上, 海伦- LP, 包含21 866 张图像的大型值训练组。 其次, 我们提议罗塔- 极极极图将整张面部图像转换成坦氏平面面部图, 在脸部和上设定一个固定的状态, 由目标平面平面平面图层指导。 将ibal- 引入, 在图层图层中, 将新图层图中, 展示显示一个原始平面层图层图, 将所有。