We seek to provide an interpretable framework for segmenting users in a population for personalized decision-making. We propose a general methodology, Market Segmentation Trees (MSTs), for learning market segmentations explicitly driven by identifying differences in user response patterns. To demonstrate the versatility of our methodology, we design two new, specialized MST algorithms: (i) Choice Model Trees (CMTs), which can be used to predict a user's choice amongst multiple options and (ii) Isotonic Regression Trees (IRTs), which can be used to solve the bid landscape forecasting problem. We provide a theoretical analysis of the asymptotic running times of our algorithmic methods, which validates their computational tractability on large datasets. We also provide a customizable, open-source code base for training MSTs in Python which employs several strategies for scalability, including parallel processing and warm starts. Finally, we assess the practical performance of MSTs on several synthetic and real world datasets, showing that our method reliably finds market segmentations which accurately model response behavior.
翻译:我们试图为人口中的用户提供可解释的框架,以便进行个性化决策。我们提出了一个一般方法,即市场分割树(MSTs),用于学习由用户反应模式差异明确驱动的市场分割。为了显示我们方法的多功能性,我们还设计了两种新的、专门的MST算法:(一) 选择模型树(CMTs),可用于预测用户在多个选项中的选择;(二) 同位素回归树(IRTs),可用于解决投标景观预测问题。我们提供了一种理论分析,分析我们算法方法的无规律运行时间,以验证其在大型数据集上的计算可移动性。我们还提供了一个可定制的、开放源代码基础,用于培训Python的MSTs,该模型采用几种可缩放战略,包括平行处理和热应启动。最后,我们评估了几个合成和真实世界数据集中MSTs的实际表现,表明我们的方法可靠地发现市场分割了准确的模型反应行为。