Dyadic data is often encountered when quantities of interest are associated with the edges of a network. As such it plays an important role in statistics, econometrics and many other data science disciplines. We consider the problem of uniformly estimating a dyadic Lebesgue density function, focusing on nonparametric kernel-based estimators which take the form of U-process-like dyadic empirical processes. We provide uniform point estimation and distributional results for the dyadic kernel density estimator, giving valid and feasible procedures for robust uniform inference. Our main contributions include the minimax-optimal uniform convergence rate of the dyadic kernel density estimator, along with strong approximation results for the associated standardized $t$-process. A consistent variance estimator is introduced in order to obtain analogous results for the Studentized $t$-process, enabling the construction of provably valid and feasible uniform confidence bands for the unknown density function. A crucial feature of U-process-like dyadic empirical processes is that they may be "degenerate" at some or possibly all points in the support of the data, a property making our uniform analysis somewhat delicate. Nonetheless we show formally that our proposed methods for uniform inference remain robust to the potential presence of such unknown degenerate points. For the purpose of implementation, we discuss uniform inference procedures based on positive semi-definite covariance estimators, mean squared error optimal bandwidth selectors and robust bias-correction methods. We illustrate the empirical finite-sample performance of our robust inference methods in a simulation study. Our technical results concerning strong approximations and maximal inequalities are of potential independent interest.
翻译:当兴趣量与网络边缘相关时,往往会遇到Dyadi数据。因此,它在统计、计量经济学和许多其他数据科学学科中起着重要作用。我们考虑了统一估算dyadi Lebesgue密度函数的问题,重点是以U-process-syadic dyadic 实验过程为形式的非参数内核测算器。我们为 dyadic 内核密度测算仪提供了统一点估计和分布结果,为稳健的判断提供了有效且可行的程序。我们的主要贡献包括:dyadic 内核密度测算和许多其他数据科学学科的最小和最佳统一趋同率,同时考虑相关标准化美元过程的强烈近似近似结果。我们采用了一致的差异估测仪,以获得类似结果,从而为未知的密度功能提供了可辨别有效且可行的统一技术信任带。U-procal-liverdic演算过程的一个关键特征是,它们可能“在最优的精确的精确度上”,在某种或可能情况下,我们正态的精确的精确性结果分析中,我们提出的数据结果仍然支持。