We consider the framework of penalized estimation where the penalty term is given by a real-valued polyhedral gauge, which encompasses methods such as LASSO, generalized LASSO, SLOPE, OSCAR, PACS and others. Each of these estimators is defined through an optimization problem and can uncover a different structure or ``pattern'' of the unknown parameter vector. We define a novel and general notion of patterns based on subdifferentials and formalize an approach to measure pattern complexity. For pattern recovery, we provide a minimal condition for a particular pattern to be detected by the procedure with positive probability, the so-called accessibility condition. Using our approach, we also introduce the stronger noiseless recovery condition. For the LASSO, it is well known that the irrepresentability condition is necessary for pattern recovery with probability larger than $1/2$ and we show that the noiseless recovery plays exactly the same role in our general framework, thereby unifying and extending the irrepresentability condition to a broad class of penalized estimators. We also show that the noiseless recovery condition can be relaxed when turning to so-called thresholded penalized estimators: we prove that the necessary condition of accessibility is already sufficient for sure pattern recovery by thresholded penalized estimation provided that the noise is small enough. Throughout the article, we demonstrate how our findings can be interpreted through a geometrical lens.
翻译:我们考虑惩罚估计框架,其中惩罚项由实值多面体范数给出,这涵盖了LASSO、广义LASSO、SLOPE、OSCAR、PACS等方法。这些估计器均通过优化问题定义,能够揭示未知参数向量的不同结构或"模式"。我们基于次微分定义了一种新颖且通用的模式概念,并形式化了衡量模式复杂度的途径。针对模式恢复,我们为特定模式以正概率被检测到(即可达性条件)提供了最小条件。利用该方法,我们还引入了更强的无噪声恢复条件。对于LASSO,众所周知不可表示性条件是以大于$1/2$概率实现模式恢复的必要条件,而我们证明无噪声恢复条件在通用框架中具有完全相同的功能,从而将不可表示性条件统一并扩展至广泛的惩罚估计器类别。我们还证明,当转向阈值惩罚估计器时,无噪声恢复条件可被放宽:我们证明只要噪声足够小,可达性这一必要条件对于阈值惩罚估计实现确定模式恢复已是充分条件。全文通过几何视角阐释了研究结果的深层含义。