Data integration has become increasingly popular owing to the availability of multiple data sources. This study considered quantile regression estimation when a key covariate had multiple proxies across several datasets. In a unified estimation procedure, the proposed method incorporates multiple proxies that have various relationships with the unobserved covariates. The proposed approach allows the inference of both the quantile function and unobserved covariates. Moreover, it does not require the quantile function's linearity and, simultaneously, accommodates both the linear and nonlinear proxies. Simulation studies have demonstrated that this methodology successfully integrates multiple proxies and revealed quantile relationships for a wide range of nonlinear data. The proposed method is applied to administrative data obtained from the Survey of Household Finances and Living Conditions provided by Statistics Korea, to specify the relationship between assets and salary income in the presence of multiple income records.
翻译:由于存在多种数据来源,数据整合越来越受欢迎。本研究在关键共变体在若干数据集中存在多个代理时,考虑了四分位回归估计。在统一的估计程序中,拟议方法包含与未观测的共变体有不同关系的多个代理体。拟议方法允许对四分位函数和未观测的共变体进行推断。此外,该方法不要求四分位函数的线性,同时包括线性和非线性代理体。模拟研究表明,这一方法成功地结合了多个代理体,并揭示了范围广泛的非线性数据中的四分位关系。拟议方法适用于韩国统计局提供的家庭财务和生活条件调查的行政数据,以具体说明存在多种收入记录的资产和工资收入之间的关系。