如果可以的话,抓住我:在Twitter上欺骗Stance探测和地理标记模型,以保护个人隐私 (Catch Me If You Can: Deceiving Stance Detection and Geotagging Models to Protect Privacy of Individuals on Twitter)

The recent advances in natural language processing have yielded many exciting developments in text analysis and language understanding models; however, these models can also be used to track people, bringing severe privacy concerns. In this work, we investigate what individuals can do to avoid being detected by those models while using social media platforms. We ground our investigation in two exposure-risky tasks, stance detection and geotagging. We explore a variety of simple techniques for modifying text, such as inserting typos in salient words, paraphrasing, and adding dummy social media posts. Our experiments show that the performance of BERT-based models fined tuned for stance detection decreases significantly due to typos, but it is not affected by paraphrasing. Moreover, we find that typos have minimal impact on state-of-the-art geotagging models due to their increased reliance on social networks; however, we show that users can deceive those models by interacting with different users, reducing their performance by almost 50%.

翻译：最近自然语言处理的进展在文本分析和语言理解模型方面产生了许多令人兴奋的进展;然而,这些模型也可以用来跟踪人,从而带来严重的隐私问题。在这项工作中,我们调查个人在使用社交媒体平台时可以做些什么以避免这些模型发现这些模型。我们把调查建立在两种暴露风险的任务、姿态探测和地理标记上。我们探索了各种修改文本的简单技术,例如插入显著字词的伤寒、抛光和添加假社交媒体文章。我们的实验表明,基于BERT的模型由于打字而调整的定位探测功能显著下降,但不受抛光的影响。此外,我们发现,由于对社交网络的依赖程度的增加,打字机对最新地理标记模型的影响最小,但我们发现,用户可以通过与不同用户互动来欺骗这些模型,将其性能降低近50%。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日