This work studies the pure-exploration setting for the convex hull feasibility (CHF) problem where one aims to efficiently and accurately determine if a given point lies in the convex hull of means of a finite set of distributions. We give a complete characterization of the sample complexity of the CHF problem in the one-dimensional setting. We present the first asymptotically optimal algorithm called Thompson-CHF, whose modular design consists of a stopping rule and a sampling rule. In addition, we provide an extension of the algorithm that generalizes several important problems in the multi-armed bandit literature. Finally, we further investigate the Gaussian bandit case with unknown variances and address how the Thompson-CHF algorithm can be adjusted to be asymptotically optimal in this setting.
翻译:这项工作研究Chompex船体可行性(CHF)问题的纯勘探环境,目的是高效率和准确地确定某一点是否存在于一组有限分布手段的圆柱体中。我们对一维环境中的CHF问题抽样复杂性作了完整的描述。我们提出了第一个非现最佳算法,即Thompson-CHF,其模块设计包括停止规则和取样规则。此外,我们提供了一种算法的延伸,该算法概括了多武装土匪文献中的若干重要问题。最后,我们进一步调查Gausian土匪案,但差异不明,并探讨如何调整Thompson-CHF算法,使之在这种环境中尽可能优化。