Extremism research has grown as an open problem for several countries during recent years, especially due to the apparition of movements such as jihadism. This and other extremist groups have taken advantage of different approaches, such as the use of Social Media, to spread their ideology, promote their acts and recruit followers. Natural Language Processing (NLP) represents a way of detecting this type of content, and several authors make use of it to describe and discriminate the discourse held by this groups, with the final objective of detecting and preventing its spread. This survey aims to review the contributions of NLP to the field of extremism research, providing the reader with a comprehensive picture of the state of the art of this research area. The content includes a description and comparison of the frequently used NLP techniques, how they were applied, the insights they provided, the most frequently used NLP software tools and the availability of datasets and data sources for research. Finally, research questions are approached and answered with highlights from the review, while future trends, challenges and directions derived from these highlights are suggested.
翻译:近年来,极端主义研究已成为若干国家的一个公开问题,特别是由于圣战等运动的外观,这种研究和其他极端主义团体利用各种办法,例如利用社会媒体传播其意识形态、宣传其行为和招募追随者。自然语言处理(NLP)是检测这类内容的一种方式,一些作者利用它来描述和歧视这些团体的言论,最终目的是发现和防止其扩散。这次调查的目的是审查全国语言方案对极端主义研究领域的贡献,向读者全面介绍这一研究领域的艺术状况。内容包括描述和比较经常使用的全国语言方案技术、应用方式、提供的见解、最常用的NLP软件工具以及研究的数据集和数据来源。最后,研究问题通过审查的要点进行探讨和回答,同时提出来自这些要点的未来趋势、挑战和方向。