This paper studies the quantization of heavy-tailed data in some fundamental statistical estimation problems, where the underlying distributions have bounded moments of some order. We propose to truncate and properly dither the data prior to a uniform quantization. Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the proposed scheme. In particular, concrete results are worked out for covariance estimation, compressed sensing, and matrix completion, all agreeing that the quantization only slightly worsens the multiplicative factor. Besides, we study compressed sensing where both covariate (i.e., sensing vector) and response are quantized. Under covariate quantization, although our recovery program is non-convex because the covariance matrix estimator lacks positive semi-definiteness, all local minimizers are proved to enjoy near optimal error bound. Moreover, by the concentration inequality of product process and covering argument, we establish near minimax uniform recovery guarantee for quantized compressed sensing with heavy-tailed noise.
翻译:本文研究一些基本统计估计问题中重尾数据的量化问题,其中基本分布点已经将某些时间相交。我们提议在统一量化之前对数据进行截断和适当抖动。我们的主要观点是,(近)微缩估计误差率只能从拟议办法产生的量化数据中实现。特别是,为共变估计、压缩感测和矩阵完成工作制定了具体结果,所有结果都一致认为,这种定量只略微恶化了多复制系数。此外,我们研究的是共变(即感测矢量)和反应都具有定量的压缩感测。在共变定量下,虽然我们的回收方案是非共变式的,因为共变矩阵估量器缺乏正半确定性,但所有地方最小化者都证明享有几乎最佳的误差。此外,由于产品过程的浓度不平等和涵盖的论点,我们为以重压缩噪音进行量化的压缩感测建立了近微成模的回收保证。