对CVaR的比亚修正的峰值 -- -- 超过临界临界值估计 (Bias-Corrected Peaks-Over-Threshold Estimation of the CVaR)

The conditional value-at-risk (CVaR) is a useful risk measure in fields such as machine learning, finance, insurance, energy, etc. When measuring very extreme risk, the commonly used CVaR estimation method of sample averaging does not work well due to limited data above the value-at-risk (VaR), the quantile corresponding to the CVaR level. To mitigate this problem, the CVaR can be estimated by extrapolating above a lower threshold than the VaR using a generalized Pareto distribution (GPD), which is often referred to as the peaks-over-threshold (POT) approach. This method often requires a very high threshold to fit well, leading to high variance in estimation, and can induce significant bias if the threshold is chosen too low. In this paper, we derive a new expression for the GPD approximation error of the CVaR, a bias term induced by the choice of threshold, as well as a bias correction method for the estimated GPD parameters. This leads to the derivation of a new estimator for the CVaR that we prove to be asymptotically unbiased. In a practical setting, we show through experiments that our estimator provides a significant performance improvement compared with competing CVaR estimators in finite samples. As a consequence of our bias correction method, it is also shown that a much lower threshold can be selected without introducing significant bias. This allows a larger portion of data to be be used in CVaR estimation compared with the typical POT approach, leading to more stable estimates. As secondary results, a new estimator for a second-order parameter of heavy-tailed distributions is derived, as well as a confidence interval for the CVaR which enables quantifying the level of variability in our estimator.

翻译：有条件风险值( CVaR) 是机器学习、金融、保险、能源等领域的一个有用的风险计量标准。在测量极端风险时,通常使用的 CVAR 样本平均估计方法由于数值高于风险值( VaR) 的有限数据( 与 CVaR 水平相对的量化), 效果不佳。为缓解这一问题, CVaR 可以通过使用通用的 Pareto 偏差分布法( GPD) 推算低于 VAR 的下限来估计。这常常被称作超超临界值( POT) 方法。在测量非常极端风险时, 常用的 CVaR 平均估计方法通常要求非常高的阈值, 导致高的估算差异, 如果选择的阈值太低, 则会引起显著的偏差。在本文中, CVaR 选择的偏差术语, 以及 GPDD 估计值的偏差修正方法。这可以导致为 CVR 的新的测算法, 使得我们通过直观的直观的直径直径的直径分析器, 显示我们的直径直径的C。