Human skin segmentation is a crucial task in computer vision and biometric systems, yet it poses several challenges such as variability in skin color, pose, and illumination. This paper presents a robust data-driven skin segmentation method for a single image that addresses these challenges through the integration of contextual information and efficient network design. In addition to robustness and accuracy, the integration into real-time systems requires a careful balance between computational power, speed, and performance. The proposed method incorporates two attention modules, Body Attention and Skin Attention, that utilize contextual information to improve segmentation results. These modules draw attention to the desired areas, focusing on the body boundaries and skin pixels, respectively. Additionally, an efficient network architecture is employed in the encoder part to minimize computational power while retaining high performance. To handle the issue of noisy labels in skin datasets, the proposed method uses a weakly supervised training strategy, relying on the Skin Attention module. The results of this study demonstrate that the proposed method is comparable to, or outperforms, state-of-the-art methods on benchmark datasets.
翻译:人类皮肤分解是计算机视觉和生物鉴别系统的一项关键任务,但它提出了若干挑战,例如肤色、面貌和光化的变异性。本文件为通过整合背景信息和高效网络设计来应对这些挑战的单一图像提供了一个强有力的数据驱动的皮肤分解方法。除了坚固性和准确性外,融入实时系统还需要在计算能力、速度和性能之间保持谨慎的平衡。拟议方法包含两个关注模块,即身体注意和皮肤注意,利用背景信息改善分解结果。这些模块吸引人们注意所需区域,分别侧重于身体边界和皮肤像素。此外,在编码部分采用了高效的网络结构,以最大限度地减少计算能力,同时保持高性能。在处理皮肤数据集中的噪音标签问题时,拟议方法使用一种监管薄弱的培训策略,依靠皮肤注意模块。这项研究的结果表明,拟议方法与基准数据集的近似或超出常规方法。