We consider the nonparametric estimation of an S-shaped regression function. The least squares estimator provides a very natural, tuning-free approach, but results in a non-convex optimisation problem, since the inflection point is unknown. We show that the estimator may nevertheless be regarded as a projection onto a finite union of convex cones, which allows us to propose a mixed primal-dual bases algorithm for its efficient, sequential computation. After developing a projection framework that demonstrates the consistency and robustness to misspecification of the estimator, our main theoretical results provide sharp oracle inequalities that yield worst-case and adaptive risk bounds for the estimation of the regression function, as well as a rate of convergence for the estimation of the inflection point. These results reveal not only that the estimator achieves the minimax optimal rate of convergence for both the estimation of the regression function and its inflection point (up to a logarithmic factor in the latter case), but also that it is able to achieve an almost-parametric rate when the true regression function is piecewise affine with not too many affine pieces. Simulations and a real data application to air pollution modelling also confirm the desirable finite-sample properties of the estimator, and our algorithm is implemented in the R package Sshaped.
翻译:我们认为,S形回归函数的非参数估算值是非参数性的。 最小正方位估计值提供了非常自然的、无调整的优化方法,但导致非曲线优化问题,因为偏差点未知。 我们表明,偏差点仍可被视为对固定的 convex 锥体结合的预测,这使我们能够提出一种混合的原始和双重基础算法,以有效、有序地计算。 在开发一个预测框架,以显示估计器的精确度和稳健性,我们的主要理论结果提供了尖锐的或极小的不平等,在估算回归函数时产生最坏的情况和适应性的风险界限,以及估计偏差点的趋同率。 这些结果不仅表明,估计器在估算回归函数及其偏差点方面实现了最小的最佳趋同率,(在后一种情况中达到对等系数),而且当真正的回归和精确的算法功能与精确的精确的空气模型也确认了我们真正的回归和精确的模型性数据,它也能够达到几乎相当的精确度率。