This paper addresses the deconvolution problem of estimating a square-integrable probability density from observations contaminated with additive measurement errors having a known density. The estimator begins with a density estimate of the contaminated observations and minimizes a reconstruction error penalized by an integrated squared $m$-th derivative. Theory for deconvolution has mainly focused on kernel- or wavelet-based techniques, but other methods including spline-based techniques and this smoothness-penalized estimator have been found to outperform kernel methods in simulation studies. This paper fills in some of these gaps by establishing asymptotic guarantees for the smoothness-penalized approach. Consistency is established in mean integrated squared error, and rates of convergence are derived for Gaussian, Cauchy, and Laplace error densities, attaining some lower bounds already in the literature. The assumptions are weak for most results; the estimator can be used with a broader class of error densities than the deconvoluting kernel. Our application example estimates the density of the mean cytotoxicity of certain bacterial isolates under random sampling; this mean cytotoxicity can only be measured experimentally with additive error, leading to the deconvolution problem. We also describe a method for approximating the solution by a cubic spline, which reduces to a quadratic program.
翻译:本文解决了一个问题,即从受到已知密度的加性测量误差污染的观测值中估计平方可积概率密度的反卷积问题。该估计器开始于污染观测值的密度估计,并最小化被积分的平方第 $m$ 次导数惩罚的重构误差。反卷积的理论主要集中在基于核或小波的技术上,但其他方法,包括基于样条的技术和这种平滑惩罚估计器,已被发现在模拟研究中优于核方法。本文填补了一些缺口,为平滑惩罚方法建立了渐近保证。在大多数结果中建立了均方误差一致收敛性,并推导出了高斯、柯西和拉普拉斯误差密度的收敛速度,可以达到一些已经在文献中提出的下限。对于大多数结果,假设都是较弱的;该估计器可以用于比解卷积内核更广泛的误差密度类别。我们的应用示例是在随机抽样下估计某些细菌株平均细胞毒性的密度;该平均细胞毒性只能通过加性误差进行实验测量,导致了解卷积问题。我们还描述了一种通过三次样条逼近解的方法,这归结为一个二次规划问题。