This paper develops new tools to quantify uncertainty in optimal decision making and to gain insight into which variables one should collect information about given the potential cost of measuring a large number of variables. We investigate simultaneous inference to determine if a group of variables is relevant for estimating an optimal decision rule in a high-dimensional semiparametric framework. The unknown link function permits flexible modeling of the interactions between the treatment and the covariates, but leads to nonconvex estimation in high dimension and imposes significant challenges for inference. We first establish that a local restricted strong convexity condition holds with high probability and that any feasible local sparse solution of the estimation problem can achieve the near-oracle estimation error bound. We further rigorously verify that a wild bootstrap procedure based on a debiased version of the local solution can provide asymptotically honest uniform inference for the effect of a group of variables on optimal decision making. The advantage of honest inference is that it does not require the initial estimator to achieve perfect model selection and does not require the zero and nonzero effects to be well-separated. We also propose an efficient algorithm for estimation. Our simulations suggest satisfactory performance. An example from a diabetes study illustrates the real application.
翻译:本文开发了新的工具,以量化最佳决策中的不确定性,并深入了解哪些变量应当收集信息,说明衡量大量变量的潜在成本。我们同时调查同时推断,以确定一组变量是否与估计高维半参数框架中最佳决策规则有关。未知联系功能允许对治疗与共变体之间的互动进行灵活的模型分析,但会导致高维度的不相容估计,并给推断带来重大挑战。我们首先确定,局部限制强的强烈共性条件存在的可能性很高,对估算问题的任何可行的本地稀释解决方案都有可能达到近乎骨骼估计错误的约束。我们进一步严格核实,基于本地解决方案的偏差版本的野靴陷阱程序能够提供对一组变量对最佳决策的影响的无限诚实的推断。诚实的推断的优点是,它并不要求初始估计者实现完美的模型选择,也不要求零和非零效应能够完全分离。我们还提议了一种高效的测算法用于估算。我们还提议了一种有效的测算方法。