Cyberbullying, identified as intended and repeated online bullying behavior, has become increasingly prevalent in the past few decades. Despite the significant progress made thus far, the focus of most existing work on cyberbullying detection lies in the independent content analysis of different comments within a social media session. We argue that such leading notions of analysis suffer from three key limitations: they overlook the temporal correlations among different comments; they only consider the content within a single comment rather than the topic coherence across comments; they remain generic and exploit limited interactions between social media users. In this work, we observe that user comments in the same session may be inherently related, e.g., discussing similar topics, and their interaction may evolve over time. We also show that modeling such topic coherence and temporal interaction are critical to capture the repetitive characteristics of bullying behavior, thus leading to better predicting performance. To achieve the goal, we first construct a unified temporal graph for each social media session. Drawing on recent advances in graph neural network, we then propose a principled graph-based approach for modeling the temporal dynamics and topic coherence throughout user interactions. We empirically evaluate the effectiveness of our approach with the tasks of session-level bullying detection and comment-level case study. Our code is released to public.
翻译:网络欺凌被确定为有意和反复的网上欺凌行为,在过去几十年中越来越普遍。尽管迄今为止取得了显著进展,但大多数关于网络欺凌探测的现有工作重心在于对社交媒体会议内不同评论的独立内容分析。我们认为,这种主要的分析概念有三大局限性:它们忽略了不同评论之间的时间相关性;它们只考虑单一评论中的内容,而不是不同评论之间的一致性问题;它们仍然是通用的,利用社交媒体用户之间的有限互动。在这项工作中,我们注意到,同届会议中的用户评论可能具有内在联系,例如讨论类似的议题,而且它们的互动可能随着时间的推移而演变。我们还表明,模拟这种主题的一致性和时间互动对于捕捉欺凌行为的重复性特征至关重要,从而导致更好地预测业绩。为了实现这一目标,我们首先为每个社交媒体会议建立一个统一的时间图。利用图表神经网络的最新进展,我们然后提出一个基于图表的有原则的方法,用以模拟整个用户互动中的时间动态和主题的一致性。我们从经验上评估我们的方法在届会一级进行欺凌人行为探测和评论的案例研究的有效性。