Customer behavior is often assumed to follow weak rationality, which implies that adding a product to an assortment will not increase the choice probability of another product in that assortment. However, an increasing amount of research has revealed that customers are not necessarily rational when making decisions. In this paper, we propose a new nonparametric choice model that relaxes this assumption and can model a wider range of customer behavior, such as decoy effects between products. In this model, each customer type is associated with a binary decision tree, which represents a decision process for making a purchase based on checking for the existence of specific products in the assortment. Together with a probability distribution over customer types, we show that the resulting model -- a decision forest -- is able to represent any customer choice model, including models that are inconsistent with weak rationality. We theoretically characterize the depth of the forest needed to fit a data set of historical assortments and prove that with high probability, a forest whose depth scales logarithmically in the number of assortments is sufficient to fit most data sets. We also propose two practical algorithms -- one based on column generation and one based on random sampling -- for estimating such models from data. Using synthetic data and real transaction data exhibiting non-rational behavior, we show that the model outperforms both rational and non-rational benchmark models in out-of-sample predictive ability.
翻译:客户行为通常被假定为遵循薄弱的合理性,这意味着将产品添加到各种产品中不会增加其他产品的选择概率。 然而,越来越多的研究显示,客户在决策时不一定具有理性。 在本文中,我们提出一个新的非参数选择模式,放松这一假设,并可以模拟更广泛的客户行为,例如产品之间的诱饵效应。在这个模型中,每种客户类型都与二进制决策树相关联,这代表了在检查各种产品是否存在的基础上进行购买的决定程序。加上客户类型之间的概率分布,我们表明,所产生的模型 -- -- 一种决策森林 -- -- 能够代表任何客户选择模式,包括与薄弱理性不相符的模型。我们从理论上描述森林的深度,以适应一套历史类比效应数据集,并证明在高概率下,一个其深度水平对准的森林足以适应大多数数据集。我们还提议两种实用的能力算法 -- -- 一种基于模型,即决策森林选择森林选择模式 -- -- -- 一种基于模型生成的模型,一种基于不可靠数据,一种基于不可靠的模型,一种基于不可靠的模拟的模拟数据,一种基于不可靠数据,一种基于不可靠的模拟的模拟数据。