We theoretically investigate the typical learning performance of $\ell_{1}$-regularized linear regression ($\ell_1$-LinR) for Ising model selection using the replica method from statistical mechanics. For typical random regular (RR) graphs in the paramagnetic phase, an accurate estimate of the typical sample complexity of $\ell_1$-LinR is obtained, demonstrating that, for an Ising model with $N$ variables, $\ell_1$-LinR is model selection consistent with $M=\mathcal{O}\left(\log N\right)$ samples. Moreover, we provide a computationally efficient method to accurately predict the non-asymptotic behavior of $\ell_1$-LinR for moderate $M$ and $N$, such as the precision and recall rates. Simulations show a fairly good agreement between the theoretical predictions and experimental results, even for graphs with many loops, which supports our findings. Although this paper focuses on $\ell_1$-LinR, our method is readily applicable for precisely investigating the typical learning performances of a wide class of $\ell_{1}$-regularized M-estimators including $\ell_{1}$-regularized logistic regression and interaction screening.
翻译:我们从理论上对使用统计力学复制方法进行模型选择的典型学习性能进行了典型的美元=1美元=1美元=1美元=LinR)的常规线性回归(ell_1美元-LinR)的典型学习性能进行了调查。对于在抛磁阶段的典型随机常规(RR)图,我们获得了对典型样本复杂性的准确估计,即$=1美元-LinR的典型学习性能的精确估计,这表明,对于具有N美元变量的Ising模型来说,$_1美元-LinR是符合美元=mámcal{O ⁇ left(log n\right)样本的模型选择的。此外,我们提供了一种计算高效的方法,精确地预测中值$_1美元-LinR(RR)的非被动行为,例如精确度和回溯率。模拟表明,理论预测和实验结果之间相当一致,即使是具有多种循环的图表,也支持我们的调查结果。虽然本文侧重于$_1美元-LinRinR(Olef),但我们的方法很容易适用于精确地调查典型的正常学习性分析1美元1美元=基础分析1美元=1美元=1美元=Mslassimalislationalislationalislationalislationalxxxxxxxxxxxxxxxxlxxxxxxxxxxxxxxxxxxxx。