The problem of Sequential Estimation under Multiple Resources (SEMR) is defined in a federated setting. SEMR could be considered as the intersection of statistical estimation and bandit theory. In this problem, an agent is confronting with k resources to estimate a parameter $\theta$. The agent should continuously learn the quality of the resources by wisely choosing them and at the end, proposes an estimator based on the collected data. In this paper, we assume that the resources' distributions are Gaussian. The quality of the final estimator is evaluated by its mean squared error. Also, we restrict our class of estimators to unbiased estimators in order to define a meaningful notion of regret. The regret measures the performance of the agent by the variance of the final estimator in comparison to the optimal variance. We propose a lower bound to determine the fundamental limit of the setting even in the case that the distributions are not Gaussian. Also, we offer an order-optimal algorithm to achieve this lower bound.
翻译:多重资源(SEMR) 下的序列估算问题在联盟环境下被定义。 SEMR 可以被视为统计估计和土匪理论的交叉点。 在这个问题上, 代理商面临着用于估算一个参数$\theta$的 k 资源。 代理商应该通过明智地选择这些参数来不断了解资源的质量。 代理商应该根据所收集的数据在最后建议一个估算器。 在本文中, 我们假设资源的分配是高山的。 最后估计器的质量要根据其平均的平方错误来评估。 另外, 我们将我们的类估算器限于公正的估算器, 以便界定一个有意义的遗憾概念。 遗憾地是测量器的性能, 与最佳差异相比, 最后估计器的性能有差异。 我们提出一个更低的界限, 以便确定即使分布不是高斯扬的, 也确定环境的基本限制。 另外, 我们提出一个有条理的优化的算法, 以达到这个更低的界限。