In this paper, we are interested in nonparametric kernel estimation of a generalized regression function, including conditional cumulative distribution and conditional quantile functions, based on an incomplete sample $(X_t, Y_t, \zeta_t)_{t\in \mathbb{ R}^+}$ copies of a continuous-time stationary ergodic process $(X, Y, \zeta)$. The predictor $X$ is valued in some infinite-dimensional space, whereas the real-valued process $Y$ is observed when $\zeta= 1$ and missing whenever $\zeta = 0$. Pointwise and uniform consistency (with rates) of these estimators as well as a central limit theorem are established. Conditional bias and asymptotic quadratic error are also provided. Asymptotic and bootstrap-based confidence intervals for the generalized regression function are also discussed. A first simulation study is performed to compare the discrete-time to the continuous-time estimations. A second simulation is also conducted to discuss the selection of the optimal sampling mesh in the continuous-time case. Finally, it is worth noting that our results are stated under ergodic assumption without assuming any classical mixing conditions.
翻译:在本文中,我们有兴趣根据不完整的样本$(X_t, Y_t, Y_t,\zeta_t)\\\\t\\\ in\mathb{R ⁇ {R ⁇ }R ⁇ $,一个连续时间固定的静态弧工艺的复制件$(X, Y,\zeta),对普遍回归功能进行非参数内核估计,包括有条件的累积分布和有条件的量子函数,以不完整的样本$(X_t, Y_t, Y_t,\zeta_t,\ t\ t)\\\ t\\\\\ t)\\\\ \\\\\\\\\\ $为基础,对普遍回归功能的预测值X美元进行非参数性内内核估计,而当美元=1美元时,实际价值为Y$(Y$)即为美元,而当$Zetata=0美元时,就会观察到实际价值为1美元;这些估计值的计算出这些估计值为1美元时,这些估计值为1美元,这些估计值为1美元。进行第二次模拟是为了讨论最佳采样的最佳采样的假设,最后假定是在不作任何连续的假设。