Internet based businesses and products (e.g. e-commerce, music streaming) are becoming more and more sophisticated every day with a lot of focus on improving customer satisfaction. A core way they achieve this is by providing customers with an easy access to their products by structuring them in catalogues using navigation bars and providing recommendations. We refer to these catalogues as product concepts, e.g. product categories on e-commerce websites, public playlists on music streaming platforms. These product concepts typically contain products that are linked with each other through some common features (e.g. a playlist of songs by the same artist). How they are defined in the backend of the system can be different for different products. In this work, we represent product concepts using database queries and tackle two learning problems. First, given sets of products that all belong to the same unknown product concept, we learn a database query that is a representation of this product concept. Second, we learn product concepts and their corresponding queries when the given sets of products are associated with multiple product concepts. To achieve these goals, we propose two approaches that combine the concepts of PU learning with Decision Trees and Clustering. Our experiments demonstrate, via a simulated setup for a music streaming service, that our approach is effective in solving these problems.
翻译:以互联网为基础的企业和产品(例如电子商务、音乐流转等)每天都越来越复杂,其重点是提高客户满意度。它们实现这一点的核心方式是,通过使用导航栏将消费者的商品编成目录,并提出建议,使他们更容易地获得产品。我们将这些目录称为产品概念,例如电子商务网站上的产品类别、音乐流平台上的公共播放列表。这些产品概念通常包含通过一些共同特点(例如同一艺术家的歌曲播放列表)相互联系的产品。在系统后端如何界定它们,对不同产品有不同的定义。在这项工作中,我们利用数据库查询和解决两个学习问题来代表产品概念。首先,如果产品都属于同一未知产品概念的一组产品,我们学习数据库查询,这是该产品概念的一种体现。第二,当给定的产品组合与多种产品概念相联系时,我们学习产品概念和相应的查询。为了实现这些目标,我们建议了两种方法,即将PU学习的概念与决策树和分组的概念结合起来,我们通过模拟这些实验来展示我们是如何解决这些问题的。