Dyadic data is often encountered when quantities of interest are associated with the edges of a network. As such it plays an important role in statistics, econometrics and many other data science disciplines. We consider the problem of uniformly estimating a dyadic Lebesgue density function, focusing on nonparametric kernel-based estimators taking the form of dyadic empirical processes. Our main contributions include the minimax-optimal uniform convergence rate of the dyadic kernel density estimator, along with strong approximation results for the associated standardized and Studentized $t$-processes. A consistent variance estimator enables the construction of valid and feasible uniform confidence bands for the unknown density function. We showcase the broad applicability of our results by developing novel counterfactual density estimation and inference methodology for dyadic data, which can be used for causal inference and program evaluation. A crucial feature of dyadic distributions is that they may be "degenerate" at certain points in the support of the data, a property making our analysis somewhat delicate. Nonetheless our methods for uniform inference remain robust to the potential presence of such points. For implementation purposes, we discuss inference procedures based on positive semi-definite covariance estimators, mean squared error optimal bandwidth selectors and robust bias correction techniques. We illustrate the empirical finite-sample performance of our methods both in simulations and with real-world trade data, for which we make comparisons between observed and counterfactual trade distributions in different years. Our technical results concerning strong approximations and maximal inequalities are of potential independent interest.
翻译:当兴趣量与网络边缘相关时,往往会遇到Dyadi数据。因此,它在统计、计量经济学和许多其他数据科学学科中起着重要作用。我们认为统一估计dyadic Lebesgue密度函数的问题,重点是非对数内核测深器,其形式为dyadic 经验过程。我们的主要贡献包括dyadic内核密度测深器的微模-最佳统一趋同率率,以及相关的标准化和学生化美元过程的强烈近似结果。一个一致的差异估测器使得能够为未知密度函数构建有效和可行的统一信任带。我们通过开发新的反数内核密度估计和dyadic数据推导法展示我们结果的广泛适用性,这些数据可用于因果性判断和程式评价。 dyadic分布的一个关键特征是,它们可能是在数据支持的某些点“退化者”,一种使我们的分析变得相当精准。尽管我们的判断力测测测测深度方法使我们的精确度、精确度的精确度程序得以实现。