Language evolves over time, and word meaning changes accordingly. This is especially true in social media, since its dynamic nature leads to faster semantic shifts, making it challenging for NLP models to deal with new content and trends. However, the number of datasets and models that specifically address the dynamic nature of these social platforms is scarce. To bridge this gap, we present TempoWiC, a new benchmark especially aimed at accelerating research in social media-based meaning shift. Our results show that TempoWiC is a challenging benchmark, even for recently-released language models specialized in social media.
翻译:语言随时间而变化,词义也随时间而变化。 在社交媒体中尤其如此,因为其动态性质导致语义变化速度更快,使得NLP模式难以应对新内容和新趋势。然而,具体针对这些社交平台动态性质的数据集和模型数量很少。为了弥补这一差距,我们向TempoWic展示了一个新的基准,特别是旨在加速基于社交媒体的观念转变的研究。我们的结果表明TempoWic是一个具有挑战性的基准,即使是最近推出的社交媒体专业语言模型也是如此。