High-dimensional data can often display heterogeneity due to heteroscedastic variance or inhomogeneous covariate effects. Penalized quantile and expectile regression methods offer useful tools to detect heteroscedasticity in high-dimensional data. The former is computationally challenging due to the non-smooth nature of the check loss, and the latter is sensitive to heavy-tailed error distributions. In this paper, we propose and study (penalized) robust expectile regression (retire), with a focus on iteratively reweighted $\ell_1$-penalization which reduces the estimation bias from $\ell_1$-penalization and leads to oracle properties. Theoretically, we establish the statistical properties of the retire estimator under two regimes: (i) low-dimensional regime in which $d \ll n$; (ii) high-dimensional regime in which $s\ll n\ll d$ with $s$ denoting the number of significant predictors. In the high-dimensional setting, we carefully characterize the solution path of the iteratively reweighted $\ell_1$-penalized retire estimation, adapted from the local linear approximation algorithm for folded-concave regularization. Under a mild minimum signal strength condition, we show that after as many as $\log(\log d)$ iterations the final iterate enjoys the oracle convergence rate. At each iteration, the weighted $\ell_1$-penalized convex program can be efficiently solved by a semismooth Newton coordinate descent algorithm. Numerical studies demonstrate the competitive performance of the proposed procedure compared with either non-robust or quantile regression based alternatives.
翻译:高维数据往往会显示由于异性差异或异性共变效应造成的异质性能。 惩罚性量化和预期回归方法为检测高维数据的异性性性能提供了有用的工具。 前者由于检查损失的非移动性质而具有计算上的挑战性, 而后者则对重度误差分布十分敏感。 在本文中, 我们提议并研究( 惩罚性) 强性预期回归( 重现性), 重点是迭代再加权 $\ ell_ 1美元 和预期性回溯性作用。 惩罚性二次回归方法为检测高维数据的偏差提供了有用的工具。 从理论上讲, 我们根据两种制度, 确定退休估计者的统计特性:(一) 低度制度, 美元=ll n 美元;(二) 高度机制, $slexlevelile n\llillional recolational recalityalalational orality adviewal labilation, 我们仔细测量了一次当地货币正值变现的货币正统性变压性变压性进程。