Conceptual graphs, which is a particular type of Knowledge Graphs, play an essential role in semantic search. Prior conceptual graph construction approaches typically extract high-frequent, coarse-grained, and time-invariant concepts from formal texts. In real applications, however, it is necessary to extract less-frequent, fine-grained, and time-varying conceptual knowledge and build taxonomy in an evolving manner. In this paper, we introduce an approach to implementing and deploying the conceptual graph at Alibaba. Specifically, We propose a framework called AliCG which is capable of a) extracting fine-grained concepts by a novel bootstrapping with alignment consensus approach, b) mining long-tail concepts with a novel low-resource phrase mining approach, c) updating the graph dynamically via a concept distribution estimation method based on implicit and explicit user behaviors. We have deployed the framework at Alibaba UC Browser. Extensive offline evaluation as well as online A/B testing demonstrate the efficacy of our approach.
翻译:概念图是一种特殊类型的知识图,在语义搜索中起着关键作用。 先前的概念图构建方法通常会从正式文本中提取高频、粗粗和时间变化的概念。 但是,在实际应用中,有必要以不断演变的方式提取较少频、细微和时间变化的概念学和构建分类学。 在本文中,我们引入了在阿里巴巴实施和部署概念图的方法。 具体地说,我们提议了一个称为AliCG的框架,它能够通过一种与一致的共识方法相提并论地提取精细的精细概念,b) 利用一种新的低资源短语采矿方法挖掘长尾概念,c) 通过基于隐含和明确的用户行为的概念分布估计方法动态地更新图形。 我们在阿里巴巴·UC浏览器采用了框架。 广泛的离线评估以及在线A/B测试显示了我们方法的功效。