决定森林:对模拟不合理选择的非对称办法 (Decision Forest: A Nonparametric Approach to Modeling Irrational Choice)

Customer behavior is often assumed to follow weak rationality, which implies that adding a product to an assortment will not increase the choice probability of another product in that assortment. However, an increasing amount of research has revealed that customers are not necessarily rational when making decisions. In this paper, we propose a new nonparametric choice model that relaxes this assumption and can model a wider range of customer behavior, such as decoy effects between products. In this model, each customer type is associated with a binary decision tree, which represents a decision process for making a purchase based on checking for the existence of specific products in the assortment. Together with a probability distribution over customer types, we show that the resulting model -- a decision forest -- is able to represent any customer choice model, including models that are inconsistent with weak rationality. We theoretically characterize the depth of the forest needed to fit a data set of historical assortments and prove that with high probability, a forest whose depth scales logarithmically in the number of assortments is sufficient to fit most data sets. We also propose two practical algorithms -- one based on column generation and one based on random sampling -- for estimating such models from data. Using synthetic data and real transaction data exhibiting non-rational behavior, we show that the model outperforms both rational and non-rational benchmark models in out-of-sample predictive ability.

翻译：客户行为通常被假定为遵循薄弱的合理性,这意味着将产品添加到各种产品中不会增加其他产品的选择概率。然而,越来越多的研究显示,客户在决策时不一定具有理性。在本文中,我们提出一个新的非参数选择模式,放松这一假设,并可以模拟更广泛的客户行为,例如产品之间的诱饵效应。在这个模型中,每种客户类型都与二进制决策树相关联,这代表了在检查各种产品是否存在的基础上进行购买的决定程序。加上客户类型之间的概率分布,我们表明,所产生的模型 -- -- 一种决策森林 -- -- 能够代表任何客户选择模式,包括与薄弱理性不相符的模型。我们从理论上描述森林的深度,以适应一套历史类比效应数据集,并证明在高概率下,一个其深度水平对准的森林足以适应大多数数据集。我们还提议两种实用的能力算法 -- -- 一种基于模型,即决策森林选择森林选择模式 -- -- -- 一种基于模型生成的模型,一种基于不可靠数据,一种基于不可靠的模型,一种基于不可靠的模拟的模拟数据,一种基于不可靠数据,一种基于不可靠的模拟的模拟数据。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

可靠深度异常检测，34页ppt，Google Balaji Lakshminarayanan讲解

专知会员服务

43+阅读 · 2021年10月1日

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

专知会员服务

36+阅读 · 2021年1月14日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

123+阅读 · 2020年5月30日