Several novel statistical methods have been developed to estimate large integrated volatility matrices based on high-frequency financial data. To investigate their asymptotic behaviors, they require a sub-Gaussian or finite high-order moment assumption for observed log-returns, which cannot account for the heavy-tail phenomenon of stock returns. Recently, a robust estimator was developed to handle heavy-tailed distributions with some bounded fourth-moment assumption. However, we often observe that log-returns have heavier tail distribution than the finite fourth-moment and that the degrees of heaviness of tails are heterogeneous across asset and over time. In this paper, to deal with the heterogeneous heavy-tailed distributions, we develop an adaptive robust integrated volatility estimator that employs pre-averaging and truncation schemes based on jump-diffusion processes. We call this an adaptive robust pre-averaging realized volatility (ARP) estimator. We show that the ARP estimator has a sub-Weibull tail concentration with only finite 2$\alpha$-th moments for any $\alpha>1$. In addition, we establish matching upper and lower bounds to show that the ARP estimation procedure is optimal. To estimate large integrated volatility matrices using the approximate factor model, the ARP estimator is further regularized using the principal orthogonal complement thresholding (POET) method. The numerical study is conducted to check the finite sample performance of the ARP estimator.
翻译:多种新颖的统计方法已被开发出来,用于基于高频金融数据估计大的积分波动矩阵。为了研究它们的渐近行为,需要一个子高斯或有限高阶矩的假设。然而,这不能解释股票回报的重尾现象。最近,开发了一个健壮的估计量,以处理有一定有界第四矩的重尾分布。然而,我们经常观察到对数回报具有比有限的第四矩更重的分布,并且尾部的重量程度在资产和时间上是异质的。为了处理异质的重尾分布,我们开发了一种自适应健壮的积分波动率估计器,该估计器使用基于跳动扩散过程的预平均和截断方案。我们将其称为自适应健壮的预平均实现波动率(ARP)估计器。我们表明,ARP估计器具有子Weibull尾部集中,对于任意$\alpha>1$仅具有有限的$2\alpha$-th矩。此外,我们建立了配对的上下界,以表明ARP估计程序是最优的。为了使用近似因子模型估计大的积分波动矩阵,还利用主正交补阈值(POET)方法对ARP估计器进行了规则化。进行数字研究以检查ARP估计器的有限样本性能。