Creating a taxonomy of interests is expensive and human-effort intensive: not only do we need to identify nodes and interconnect them, in order to use the taxonomy, we must also connect the nodes to relevant entities such as users, pins, and queries. Connecting to entities is challenging because of ambiguities inherent to language but also because individual interests are dynamic and evolve. Here, we offer an alternative approach that begins with bottom-up discovery of $\mu$-topics called pincepts. The discovery process itself connects these $\mu$-topics dynamically with relevant queries, pins, and users at high precision, automatically adapting to shifting interests. Pincepts cover all areas of user interest and automatically adjust to the specificity of user interests and are thus suitable for the creation of various kinds of taxonomies. Human experts associate taxonomy nodes with $\mu$-topics (on average, 3 $\mu$-topics per node), and the $\mu$-topics offer a high-level data layer that allows quick definition, immediate inspection, and easy modification. Even more powerfully, $\mu$-topics allow easy exploration of nearby semantic space, enabling curators to spot and fill gaps. Curators' domain knowledge is heavily leveraged and we thus don't need untrained mechanical Turks, allowing further cost reduction. These $\mu$-topics thus offer a satisfactory "symbolic" stratum over which to define taxonomies. We have successfully applied this technique for very rapidly iterating on and launching the home decor and fashion styles taxonomy for style-based personalization, prominently featured at the top of Pinterest search results, at 94% precision, improving search success rate by 34.8% as well as boosting long clicks and pin saves.
翻译:创建利益分类是昂贵的,也是人为努力的密集:我们不仅需要识别节点和连接节点,而且为了使用分类,我们还必须将这些节点与用户、针头和查询等相关实体连接起来。与实体连接具有挑战性,因为语言本身含混不清,但也因为个人利益是动态和进化的。在这里,我们提供一种替代方法,从自下发现$mu美元这个叫做尖点开始。发现过程本身将这些 $mu美元 主题与相关查询、针头和用户动态连接起来,以便使用高精确度、自动适应变化的利益。感应覆盖用户感兴趣的所有领域,并自动适应用户兴趣的特殊性。因此,与实体连接起来是具有挑战性的,因为语言本身的模糊性,也因为个人利益是动态和进化的。 人类专家将分类节点与$mu(平均,3美元mu$mu美元)的节点联系起来,我们通过节点来填补一个高层次的数据层, 用于快速的搜索、即时时态和易变换的货币。因此,让我们能够快速地在上进行内部的探索。因此, 需要一个更深入的节点。