We seek to provide an interpretable framework for segmenting users in a population for personalized decision-making. We propose a general methodology, Market Segmentation Trees (MSTs), for learning market segmentations explicitly driven by identifying differences in user response patterns. To demonstrate the versatility of our methodology, we design two new, specialized MST algorithms: (i) Choice Model Trees (CMTs), which can be used to predict a user's choice amongst multiple options and (ii) Isotonic Regression Trees (IRTs), which can be used to solve the bid landscape forecasting problem. We provide a theoretical analysis of the asymptotic running times of our algorithmic methods, which validates their computational tractability on large datasets. We also provide a customizable, open-source code base for training MSTs in Python which employs several strategies for scalability, including parallel processing and warm starts. Finally, we assess the practical performance of MSTs on several synthetic and real world datasets, showing that our method reliably finds market segmentations which accurately model response behavior. Moreover, MSTs are interpretable since the market segments can easily be described by a decision tree and often require only a fraction of the number of market segments generated by traditional approaches.
翻译:我们试图为人口中的用户提供可解释的分类框架,以便进行个性化决策。我们提出了一个一般方法,即市场分割树(MSTs),用于学习由识别用户反应模式差异明确驱动的市场分割。为了显示我们方法的多功能性,我们还设计了两种新的、专门的MST算法:(一) 选择模型树(CMTs),可用于预测用户在多种选项中的选择;(二) 同位素回归树(IRTs),可用于解决投标景观预测问题。我们提供了一种理论分析,分析我们算法方法的缓慢运行时间,以验证其在大型数据集中的计算可移动性。我们还设计了一个可定制的、开放源代码基础,用于培训Python的MSTs,该模型可采用几种可缩放战略,包括平行处理和热启动。最后,我们评估了几个合成和真实世界数据集中MSTs的实际表现,表明我们的方法可以可靠地找到准确的市场分割模式反应行为。此外,MSTs常常需要按市场小段来解释市场决策。