The median absolute deviation (MAD) is a popular robust measure of statistical dispersion. However, when it is applied to non-parametric distributions (especially multimodal, discrete, or heavy-tailed), lots of statistical inference issues arise. Even when it is applied to distributions with slight deviations from normality and these issues are not actual, the Gaussian efficiency of the MAD is only 37% which is not always enough. In this paper, we introduce the quantile absolute deviation (QAD) as a generalization of the MAD. This measure of dispersion provides a flexible approach to analyzing properties of non-parametric distributions. It also allows controlling the trade-off between robustness and statistical efficiency. We use the trimmed Harrell-Davis median estimator based on the highest density interval of the given width as a complimentary median estimator that gives increased finite-sample Gaussian efficiency compared to the sample median and a breakdown point matched to the QAD. As a rule of thumb, we suggest using two new measures of dispersion called the standard QAD and the optimal QAD. They give 54% and 65% of Gaussian efficiency having breakdown points of 32% and 14% respectively.
翻译:中位绝对偏差(MAD)是流行的统计分布的稳健度度量。然而,当对非参数分布(特别是多式联运、离散或重尾)应用时,会出现大量统计推论问题。即使将中位绝对偏差(MAD)应用于与正常度略有偏差的分布,而且这些问题并不实际,MAD的高斯效率仅为37%,这并不总是足够。在本文中,我们引入了四分绝对偏差(QAD)作为MAD的概括性。这种分散度量量度提供了分析非参数分布特性的灵活方法。它也允许控制稳健性和统计效率之间的权衡。我们使用基于给定宽度最高密度间隔的三角Harrell-Davis中位估量器作为补充中位估量器,该中位估量器与样本中位中位值相比提高了定点的效率,与QAD(QAD)相匹配。作为缩略图,我们建议使用两种新的分散度测量方法,即标准QAD和最高QAD(分别为54%和最高QAD)断点的54%。