We consider a class of assortment optimization problems in an offline data-driven setting. A firm does not know the underlying customer choice model but has access to an offline dataset consisting of the historically offered assortment set, customer choice, and revenue. The objective is to use the offline dataset to find an optimal assortment. Due to the combinatorial nature of assortment optimization, the problem of insufficient data coverage is likely to occur in the offline dataset. Therefore, designing a provably efficient offline learning algorithm becomes a significant challenge. To this end, we propose an algorithm referred to as Pessimistic ASsortment opTimizAtion (PASTA for short) designed based on the principle of pessimism, that can correctly identify the optimal assortment by only requiring the offline data to cover the optimal assortment under general settings. In particular, we establish a regret bound for the offline assortment optimization problem under the celebrated multinomial logit model. We also propose an efficient computational procedure to solve our pessimistic assortment optimization problem. Numerical studies demonstrate the superiority of the proposed method over the existing baseline method.
翻译:在离线数据驱动环境下,我们考虑类类类类类的分类优化优化问题。一个公司并不了解基本的客户选择模式,但可以访问离线数据集,该数据集由历史提供的分类集、客户选择和收入组成。目标是利用离线数据集寻找最佳分类。由于分类优化的组合性质,离线数据集中可能会出现数据覆盖不足的问题。因此,设计一个可辨别高效的离线学习算法是一项重大挑战。为此,我们提议了一种基于悲观原则设计的离线数据集,该算法可以正确确定最佳分类,因为仅要求离线数据覆盖一般情况下的最佳分类。特别是,我们为在已知的多音调对数模型下离线的离线优化问题设定了遗憾。我们还建议了一种高效的计算程序,用以解决我们目前压倒性模型的磁性模型方法问题。我们还提议了一种高效的计算程序,用以证明目前磁性模型的磁性分析方法。