Facial forgery by deepfakes has raised severe societal concerns. Several solutions have been proposed by the vision community to effectively combat the misinformation on the internet via automated deepfake detection systems. Recent studies have demonstrated that facial analysis-based deep learning models can discriminate based on protected attributes. For the commercial adoption and massive roll-out of the deepfake detection technology, it is vital to evaluate and understand the fairness (the absence of any prejudice or favoritism) of deepfake detectors across demographic variations such as gender and race. As the performance differential of deepfake detectors between demographic subgroups would impact millions of people of the deprived sub-group. This paper aims to evaluate the fairness of the deepfake detectors across males and females. However, existing deepfake datasets are not annotated with demographic labels to facilitate fairness analysis. To this aim, we manually annotated existing popular deepfake datasets with gender labels and evaluated the performance differential of current deepfake detectors across gender. Our analysis on the gender-labeled version of the datasets suggests (a) current deepfake datasets have skewed distribution across gender, and (b) commonly adopted deepfake detectors obtain unequal performance across gender with mostly males outperforming females. Finally, we contributed a gender-balanced and annotated deepfake dataset, GBDF, to mitigate the performance differential and to promote research and development towards fairness-aware deep fake detectors. The GBDF dataset is publicly available at: https://github.com/aakash4305/GBDF
翻译:深假的假冒引起了严重的社会关注。视觉界提出了若干解决方案,以通过自动深假检测系统有效打击互联网上的错误信息。最近的研究表明,基于面部分析的深层学习模式可以基于受保护的属性进行歧视。为了商业上采用和大规模推出深假检测技术,必须评估和理解深假探测器在性别与种族等人口差异中的公平性(没有任何偏见或偏向性)问题。由于深假探测器在人口分组之间的性能差异将影响数百万贫困分组的人。本文旨在评估深假探测器对男性和女性的公平性能。然而,现有的深假数据集并没有附加人口标签以促进公平性分析。为此,我们手动用性别标签来说明现有流行的深假死数据集,并评估当前性别深度假死探测器的性能差异。我们用性别标签的数据集分析显示:(a)当前深底底底基数据集的性能分析将降低性别公平性能,我们通常通过性别-深底底色数据促进性别-高压数据。我们通常通过性别-深底底底底基数据促进性别-性别-高压数据。