POI-level geo-information of social posts is critical to many location-based applications and services. However, the multi-modality, complexity and diverse nature of social media data and their platforms limit the performance of inferring such fine-grained locations and their subsequent applications. To address this issue, we present a transformer-based general framework, which builds upon pre-trained language models and considers non-textual data, for social post geolocation at the POI level. To this end, inputs are categorized to handle different social data, and an optimal combination strategy is provided for feature representations. Moreover, a uniform representation of hierarchy is proposed to learn temporal information, and a concatenated version of encodings is employed to capture feature-wise positions better. Experimental results on various social datasets demonstrate that three variants of our proposed framework outperform multiple state-of-art baselines by a large margin in terms of accuracy and distance error metrics.
翻译:社会职位的POI级地理信息对于许多基于地点的应用和服务至关重要,然而,社交媒体数据及其平台的多模式、复杂性和多样性限制了推算这类精细地段及其随后应用的性能。为解决这一问题,我们提出了一个基于变压器的一般框架,该框架以预先培训的语言模型为基础,并考虑非文字数据,用于在POI级的社会职位定位。为此,对投入进行了分类,以处理不同的社会数据,并为特征表现提供了最佳组合战略。此外,还提议了一种统一的等级代表制,以学习时间信息,并采用了一种分类编码版本,以更好地捕捉地貌位置。各种社会数据集的实验结果表明,我们拟议框架的三种变式在准确度和距离误差度衡量尺度上大大超越了多个最先进的基线。