The major aim of this paper is to explain the data poisoning attacks using label-flipping during the training stage of the electroencephalogram (EEG) signal-based human emotion evaluation systems deploying Machine Learning models from the attackers' perspective. Human emotion evaluation using EEG signals has consistently attracted a lot of research attention. The identification of human emotional states based on EEG signals is effective to detect potential internal threats caused by insider individuals. Nevertheless, EEG signal-based human emotion evaluation systems have shown several vulnerabilities to data poison attacks. The findings of the experiments demonstrate that the suggested data poison assaults are model-independently successful, although various models exhibit varying levels of resilience to the attacks. In addition, the data poison attacks on the EEG signal-based human emotion evaluation systems are explained with several Explainable Artificial Intelligence (XAI) methods, including Shapley Additive Explanation (SHAP) values, Local Interpretable Model-agnostic Explanations (LIME), and Generated Decision Trees. And the codes of this paper are publicly available on GitHub.
翻译:本文的主要目的是解释在电子脑图(EEG)培训阶段使用基于信号的人类情感评价系统,从攻击者的角度运用机器学习模型,使用EEG信号的人类情感评价一贯引起许多研究关注。根据EEG信号识别人类情感状态对于发现内人可能造成的内部威胁是有效的。然而,基于EEG信号的人类情感评价系统显示,数据毒物攻击存在若干弱点。实验结果表明,建议的数据毒物攻击是独立的模型成功的,尽管各种模型显示对攻击具有不同程度的复原力。此外,对EEG基于信号的人类情感评价系统进行的数据毒物攻击还用几种可解释的人工智能(XAI)方法加以解释,这些方法包括:皮肤解释(SHAP)值、局部间解释模型(LIME)和基因决定树。本文的代码在GitHub上公布。