We consider the problem of approximating a function in general nonlinear subsets of $L^2$ when only a weighted Monte Carlo estimate of the $L^2$-norm can be computed. Of particular interest in this setting is the concept of sample complexity, the number of samples that are necessary to recover the best approximation. Bounds for this quantity have been derived in a previous work and depend primarily on the model class and are not influenced positively by the regularity of the sought function. This result however is only a worst-case bound and is not able to explain the remarkable performance of iterative hard thresholding algorithms that is observed in practice. We reexamine the results of the previous paper and derive a new bound that is able to utilize the regularity of the sought function. A critical analysis of our results allows us to derive a sample efficient algorithm for the model set of low-rank tensors. The viability of this algorithm is demonstrated by recovering quantities of interest for a classical high-dimensional random partial differential equation.
翻译:我们认为,当只能计算蒙特卡洛对美元-诺尔的加权估计值时,在一般的非线性子子集中约准2美元($L%2美元)的函数的问题。在这一背景下,特别感兴趣的是样本的复杂性概念,即恢复最佳近似所需的样本数量。这一数量在以前的工作中已经得出,主要取决于模型类别,并且没有受到所寻求功能的规律性的积极影响。然而,这一结果只是一个最坏的情况,无法解释在实践中所观察到的迭接硬阈值算法的显著性能。我们重新审查了前一份文件的结果,并提出了能够利用所寻求函数的规律性的新约束。对结果进行严格分析后,我们得以为低级电压模型集得出一个高效的样本算法。这一算法的可行性表现在为对传统的高度随机偏差等式回收利息的数量上。