This paper establishes a central limit theorem under the assumption that conditional variances can vary in a largely unstructured history-dependent way across experiments subject only to the restriction that they lie in a fixed interval. Limits take a novel and tractable form, and are expressed in terms of oscillating Brownian motion. A second contribution is application of this result to a class of multi-armed bandit problems where the decision-maker is loss averse.
翻译:本文设定了一个核心限制理论,其假设是,有条件差异在各种实验之间可能发生基本上没有结构的、取决于历史的差别,但只能受限于它们处于固定间隔的限制。 限制具有新颖和可移动的形式,以摇动布朗运动的形式表达。 第二个贡献是,这一结果适用于决策者不愿失去的一类多武装土匪问题。