AI偏见在深度伪造检测中的全面分析：大规模注释数据库的研究 (A Comprehensive Analysis of AI Biases in DeepFake Detection With Massively Annotated Databases) - 专知论文

会员服务 ·

0

DeepFakes · 有偏 · Backbone · MoDELS · Analysis ·

2023 年 3 月 23 日

A Comprehensive Analysis of AI Biases in DeepFake Detection With Massively Annotated Databases

翻译：AI偏见在深度伪造检测中的全面分析：大规模注释数据库的研究

Ying Xu,Philipp Terhörst,Kiran Raja,Marius Pedersen

In recent years, image and video manipulations with Deepfake have become a severe concern for security and society. Many detection models and datasets have been proposed to detect Deepfake data reliably. However, there is an increased concern that these models and training databases might be biased and, thus, cause Deepfake detectors to fail. In this work, we investigate the bias issue caused by public Deepfake datasets by (a) providing large-scale demographic and non-demographic attribute annotations of 47 different attributes for five popular Deepfake datasets and (b) comprehensively analysing AI-bias of three state-of-the-art Deepfake detection backbone models on these datasets. The investigation analyses the influence of a large variety of distinctive attributes (from over 65M labels) on the detection performance, including demographic (age, gender, ethnicity) and non-demographic (hair, skin, accessories, etc.) information. The results indicate that investigated databases lack diversity and, more importantly, show that the utilised Deepfake detection backbone models are strongly biased towards many investigated attributes. The Deepfake detection backbone methods, which are trained with biased datasets, might output incorrect detection results, thereby leading to generalisability, fairness, and security issues. We hope that the findings of this study and the annotation databases will help to evaluate and mitigate bias in future Deepfake detection techniques. The annotation datasets are publicly available.

翻译：近年来，深度伪造技术在图像和视频操纵中引起了严重的安全和社会关注。许多检测模型和数据集已经被提出，以可靠地检测深度伪造数据。然而，人们越来越担心，这些模型和训练数据库可能存在偏见，从而导致深度伪造检测器失灵。本文通过提供47种不同属性的大规模民族和非民族属性注释以及全面分析三种最先进的深度伪造检测主干模型在这些数据集上的AI偏见问题，研究了公共深度伪造数据集引起的偏见问题。研究分析了包括民族（年龄、性别、种族）和非民族（头发、皮肤、配饰等）信息在内的大量特征（超过6500万标签）对检测性能的影响。结果表明，调查的数据库缺乏多样性，更重要的是，所使用的深度伪造检测主干模型对许多调查属性存在强烈的偏见。使用偏见数据集训练的深度伪造检测主干方法可能会输出不正确的检测结果，从而导致通用性、公正性和安全性问题。我们希望本研究的发现和注释数据集能够帮助评估和缓解未来深度伪造检测技术中的偏见问题。这些注释数据集是公开可用的。

0

相关内容

DeepFakes

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

专知会员服务

41+阅读 · 2020年7月14日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

55+阅读 · 2020年4月26日

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

专知

12+阅读 · 2018年6月9日

【论文推荐】最新五篇视频分类相关论文—细粒度行人识别、群组归一化、MLtuner、时序特征

【论文推荐】最新五篇视频分类相关论文—细粒度行人识别、群组归一化、MLtuner、时序特征

专知

22+阅读 · 2018年4月21日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

专知

12+阅读 · 2018年3月15日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

AD易感基因RPL13的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

宿主eIF4AII因子与传染性法氏囊病病毒VP1相互作用对病毒复制的影响及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

ATP13A2基因亚型Ala746Thr和Thr12met突变与新疆维吾尔族早发型和家族型帕金森病临床的相关研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

高致病性PRRSV（HuN4株）对仔猪中枢免疫器官的感染及损伤机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于同卵双生非共患孤独症的表观遗传修饰及验证研究

国家自然科学基金

0+阅读 · 2012年12月31日

体外构建角膜内皮细胞膜片行后弹力层内皮移植后的功能评价

国家自然科学基金

0+阅读 · 2011年12月31日

腈水解酶立体选择性分子机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

实用后量子线性公钥加密的错误嵌入与通用构造

国家自然科学基金

0+阅读 · 2011年12月31日

人偏肺病毒组织嗜性的基因基础的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Quadratic Functional Encryption for Secure Training in Vertical Federated Learning

Arxiv

0+阅读 · 2023年5月15日

Improving Defensive Distillation using Teacher Assistant

Arxiv

0+阅读 · 2023年5月14日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月14日

CryptoEval: Evaluating the Risk of Cryptographic Misuses in Android Apps with Data-Flow Analysis

Arxiv

0+阅读 · 2023年5月13日

Color Deconvolution applied to Domain Adaptation in HER2 histopathological images

Arxiv

0+阅读 · 2023年5月12日

The Ethics of AI in Games

Arxiv

0+阅读 · 2023年5月12日

Explore the Power of Synthetic Data on Few-shot Object Detection

Arxiv

0+阅读 · 2023年5月12日

SepMark: Deep Separable Watermarking for Unified Source Tracing and Deepfake Detection

Arxiv

0+阅读 · 2023年5月10日

A Survey on Deep Domain Adaptation and Tiny Object Detection Challenges, Techniques and Datasets

Arxiv

17+阅读 · 2021年7月16日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

VIP会员

文章信息

相关主题

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

专知会员服务

41+阅读 · 2020年7月14日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

55+阅读 · 2020年4月26日

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

专知

12+阅读 · 2018年6月9日

【论文推荐】最新五篇视频分类相关论文—细粒度行人识别、群组归一化、MLtuner、时序特征

【论文推荐】最新五篇视频分类相关论文—细粒度行人识别、群组归一化、MLtuner、时序特征

专知

22+阅读 · 2018年4月21日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

专知

12+阅读 · 2018年3月15日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Quadratic Functional Encryption for Secure Training in Vertical Federated Learning

Arxiv

0+阅读 · 2023年5月15日

Improving Defensive Distillation using Teacher Assistant

Arxiv

0+阅读 · 2023年5月14日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月14日

CryptoEval: Evaluating the Risk of Cryptographic Misuses in Android Apps with Data-Flow Analysis

Arxiv

0+阅读 · 2023年5月13日

Color Deconvolution applied to Domain Adaptation in HER2 histopathological images

Arxiv

0+阅读 · 2023年5月12日

The Ethics of AI in Games

Arxiv

0+阅读 · 2023年5月12日

Explore the Power of Synthetic Data on Few-shot Object Detection

Arxiv

0+阅读 · 2023年5月12日

SepMark: Deep Separable Watermarking for Unified Source Tracing and Deepfake Detection

Arxiv

0+阅读 · 2023年5月10日

A Survey on Deep Domain Adaptation and Tiny Object Detection Challenges, Techniques and Datasets

Arxiv

17+阅读 · 2021年7月16日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

相关基金

AD易感基因RPL13的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

宿主eIF4AII因子与传染性法氏囊病病毒VP1相互作用对病毒复制的影响及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

ATP13A2基因亚型Ala746Thr和Thr12met突变与新疆维吾尔族早发型和家族型帕金森病临床的相关研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

高致病性PRRSV（HuN4株）对仔猪中枢免疫器官的感染及损伤机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于同卵双生非共患孤独症的表观遗传修饰及验证研究

国家自然科学基金

0+阅读 · 2012年12月31日

体外构建角膜内皮细胞膜片行后弹力层内皮移植后的功能评价

国家自然科学基金

0+阅读 · 2011年12月31日

腈水解酶立体选择性分子机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

实用后量子线性公钥加密的错误嵌入与通用构造

国家自然科学基金

0+阅读 · 2011年12月31日

人偏肺病毒组织嗜性的基因基础的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员