Many probabilistic models that have an intractable normalizing constant may be extended to contain covariates. Since the evaluation of the exact likelihood is difficult or even impossible for these models, score matching was proposed to avoid explicit computation of the normalizing constant. In the literature, score matching has so far only been developed for models in which the observations are independent and identically distributed (IID). However, the IID assumption does not hold in the traditional fixed design setting for regression-type models. To deal with the estimation of these covariate-dependent models, this paper presents a new score matching approach for independent but not necessarily identically distributed data under a general framework for both continuous and discrete responses, which includes a novel generalized score matching method for count response regression. We prove that our proposed score matching estimators are consistent and asymptotically normal under mild regularity conditions. The theoretical results are supported by simulation studies and a real-data example. Additionally, our simulation results indicate that, compared to approximate maximum likelihood estimation, the generalized score matching produces estimates with substantially smaller biases in an application to doctoral publication data.
翻译:由于这些模型很难或甚至不可能对确切可能性作出评估,因此建议进行得分比对,以避免对正常常数进行明确的计算。在文献中,迄今只为观测独立且分布相同的模型(IID)制定了得分比对。然而,IID假设在传统的回归型模型固定设计设置中并不具备。为了估算这些依赖共变模式的模型,本文件为连续和离散反应的总框架下独立但不一定相同分布的数据提供了一种新的得分比对法方法,其中包括一种新的通用得分比对法方法,用于计数回归。我们证明,我们提议的得分比对算符在温和正常条件下是一致的。理论结果得到模拟研究和真实数据实例的支持。此外,我们的模拟结果表明,与估计的近似最大可能性相比,通用得分比对博士出版物数据应用中的估计产生少得多的偏差。