以文字嵌入形式攻击定级定级等级:招聘定级案例研究 (Attacks against Ranking Algorithms with Text Embeddings: a Case Study on Recruitment Algorithms)

Recently, some studies have shown that text classification tasks are vulnerable to poisoning and evasion attacks. However, little work has investigated attacks against decision making algorithms that use text embeddings, and their output is a ranking. In this paper, we focus on ranking algorithms for recruitment process, that employ text embeddings for ranking applicants resumes when compared to a job description. We demonstrate both white box and black box attacks that identify text items, that based on their location in embedding space, have significant contribution in increasing the similarity score between a resume and a job description. The adversary then uses these text items to improve the ranking of their resume among others. We tested recruitment algorithms that use the similarity scores obtained from Universal Sentence Encoder (USE) and Term Frequency Inverse Document Frequency (TF IDF) vectors. Our results show that in both adversarial settings, on average the attacker is successful. We also found that attacks against TF IDF is more successful compared to USE.

翻译：最近,一些研究显示,文本分类任务容易受到毒害和规避攻击;然而,几乎没有调查攻击使用文本嵌入器的决策算法的事件,其产出是排名。在本文件中,我们侧重于征聘过程的排名算法,即采用排名申请人的文本嵌入器,与职务说明相比,恢复了排名申请人的排名算法。我们展示了白箱和黑盒袭击,以文字项目在嵌入空间的位置为基础,对提高简历和职务说明之间的相似性分数作出了重大贡献。对手然后使用这些文本项目来改进复出品的排名等。我们测试了从通用判刑编码器和Term Riod River Document Orence Henters 获得的相似性分数的征聘算法。我们的结果显示,在两种对抗环境中,攻击者平均都获得成功。我们还发现,对TF 以色列国防军的攻击比USE更成功。