Binned scatter plots, or binscatters, have become a popular and convenient tool in applied microeconomics for visualizing bivariate relations and conducting informal specification testing. However, a binscatter, on its own, is very limited in what it can characterize about the conditional mean. We introduce a suite of formal and visualization tools based on binned scatter plots to restore, and in some dimensions surpass, the visualization benefits of the classical scatter plot. We deliver a comprehensive toolkit for applications, including estimation of conditional mean and quantile functions, visualization of variance and precise quantification of uncertainty, and formal tests of substantive hypotheses such as linearity or monotonicity, and an extension to testing differences across groups. To do so we give an extensive theoretical analysis of binscatter and related partition-based methods, accommodating nonlinear and potentially nonsmooth models, which allows us to treat binary, count, and other discrete outcomes as well. We also correct a methodological mistake related to covariate adjustment present in prior implementations, which yields an incorrect shape and support of the conditional mean. All of our results are implemented in publicly available software, and showcased with three substantive empirical illustrations. Our empirical results are dramatically different when compared to those obtained using the prevalent methods in the literature.
翻译:在应用微观经济学中,捆绑散落地块,即垃圾散落地块,已成为一种流行和方便的工具,用于可视化双轨关系和进行非正式的规格测试。然而,垃圾箱本身在条件平均值的特征方面非常有限。我们推出一套基于垃圾散落地的正规和可视化工具,以恢复并在某些方面超过传统散落地块的可视化效益。我们提供了一套综合应用工具,包括估计有条件的中值和量函数、可视化差异和精确量化不确定性,以及正式测试实质性假设,如线性或单质性,以及扩大对不同群体差异的测试。为了做到这一点,我们广泛从理论角度分析了垃圾散散落和相关分割法,容纳非线性和潜在非线性模型,从而使我们能够处理典型散散落地块的二进化、计数和其他离散结果。我们还纠正了先前实施过程中出现的与混杂调整有关的方法错误,这些调整产生了不正确的形状和支持有条件平均值。我们的所有结果都是在可公开获取的软件和实验性模型中实施的。