Doubly truncated data arise in many areas such as astronomy, econometrics, and medical studies. For the regression analysis with doubly truncated response variables, the existence of double truncation may bring bias for estimation as well as affect variable selection. We propose a simultaneous estimation and variable selection procedure for the doubly truncated regression, allowing a diverging number of regression parameters. To remove the bias introduced by the double truncation, a Mann-Whitney-type loss function is used. The adaptive LASSO penalty is then added into the loss function to achieve simultaneous estimation and variable selection. An iterative algorithm is designed to optimize the resulting objective function. We establish the consistency and the asymptotic normality of the proposed estimator. The oracle property of the proposed selection procedure is also obtained. Some simulation studies are conducted to show the finite sample performance of the proposed approach. We also apply the method to analyze a real astronomical data.
翻译:在许多领域,例如天文学、计量经济学和医学研究,都会出现多断层的数据。对于使用双断层响应变量的回归分析,存在双断层截断可能会给估计带来偏差,并影响变量的选择。我们建议对双断层回归同时进行估算和可变选择程序,允许不同数量的回归参数。为了消除双断层截断带来的偏差,将使用曼-惠特尼型损失函数。然后在损失函数中添加适应性LASSO处罚,以同时进行估算和变量选择。迭接算法旨在优化由此产生的目标功能。我们建立拟议估算符的一致性和无损常性常态。还获得了拟议选择程序的奥甲特性。进行一些模拟研究以显示拟议方法的有限样本性能。我们还采用这种方法分析真实的天文数据。