Bayesian adaptive experimental design is a form of active learning, which chooses samples to maximize the information they give about uncertain parameters. Prior work has shown that other forms of active learning can suffer from active learning bias, where unrepresentative sampling leads to inconsistent parameter estimates. We show that active learning bias can also afflict Bayesian adaptive experimental design, depending on model misspecification. We develop an information-theoretic measure of misspecification, and show that worse misspecification implies more severe active learning bias. At the same time, model classes incorporating more "noise" - i.e., specifying higher inherent variance in observations - suffer less from active learning bias, because their predictive distributions are likely to overlap more with the true distribution. Finally, we show how these insights apply to a (simulated) preference learning experiment.
翻译:Bayesian适应性实验设计是一种积极的学习形式,它选择样本,以尽量扩大它们提供的关于不确定参数的信息。先前的工作已经表明,其他形式的积极学习可能会受到积极的学习偏差的影响,而缺乏代表性的抽样则导致参数估计不一致。我们表明,积极的学习偏差也会影响Bayesian适应性实验设计,这取决于模型的偏差。我们开发了一种信息理论分辨误差的测量方法,并表明更差的偏差意味着更严重的积极学习偏差。与此同时,包含更多“噪音”的模型班――即具体说明观测中更大的内在差异――在积极学习偏差中受害较少,因为它们的预测分布可能与真实分布发生更多重叠。最后,我们展示这些洞察力如何适用于(模拟的)偏好学习实验。