We introduce the problem of proficiency modeling: Given a user's posts on a social media platform, the task is to identify the subset of posts or topics for which the user has some level of proficiency. This enables the filtering and ranking of social media posts on a given topic as per user proficiency. Unlike experts on a given topic, proficient users may not have received formal training and possess years of practical experience, but may be autodidacts, hobbyists, and people with sustained interest, enabling them to make genuine and original contributions to discourse. While predicting whether a user is an expert on a given topic imposes strong constraints on who is a true positive, proficiency modeling implies a graded scoring, relaxing these constraints. Put another way, many active social media users can be assumed to possess, or eventually acquire, some level of proficiency on topics relevant to their community. We tackle proficiency modeling in an unsupervised manner by utilizing user embeddings to model engagement with a given topic, as indicated by a user's preference for authoring related content. We investigate five alternative approaches to model proficiency, ranging from basic ones to an advanced, tailored user modeling approach, applied within two real-world benchmarks for evaluation.
翻译:我们引入了熟练程度模型问题:鉴于用户在社交媒体平台上的职位,我们的任务是确定用户具有一定熟练程度的职位或专题的子集,从而能够按照用户熟练程度筛选和排列某一专题的社会媒体职位。与特定专题的专家不同,熟练的用户可能没有接受过正式培训,并拥有多年的实际经验,但可能是自成一体的行为、业余爱好者和具有持续兴趣的人,使他们能够对讨论作出真正和原始的贡献。在预测用户是否是某一专题的专家时,对谁真正是积极的,熟练程度模型意味着分级,放松这些限制。另外,许多活跃的社会媒体用户可以假定拥有或最终获得与其社区有关的专题的熟练程度。我们通过用户倾向于编写相关内容来将熟练程度模型纳入某一专题的模型,从而以不受监督的方式处理熟练程度模型问题。我们调查了在现实世界范围内应用的从基本标准到高级用户定制的用户模型模型评估的五种替代方法。