We propose a type of generalised additive models with of model components based on pair-copula constructions, with prediction as a main aim. The model components are designed such that our model may capture potentially complex interaction effects in the relationship between the response covariates. In addition, our model does not require discretisation of continuous covariates, and is therefore suitable for problems with many such covariates. Further, we have designed a fitting algorithm inspired by gradient boosting, as well as efficient procedures for model selection and evaluation of the model components, through constraints on the model space and approximations, that speed up time-costly computations. In addition to being absolutely necessary for our model to be a realistic alternative in higher dimensions, these techniques may also be useful as a basis for designing efficient models selection algorithms for other types of copula regression models. We have explored the characteristics of our method in a simulation study, in particular comparing it to natural alternatives, such as logic regression, classic boosting models and penalised logistic regression. We have also illustrated our approach on the Wisconsin breast cancer dataset and on the Boston housing dataset. The results show that our method has a prediction performance that is either better than or comparable to the other methods, even when the proportion of discrete covariates is high.
翻译:我们建议了一种通用的添加模型,其中含有基于造对面板的建构的模型元件,预测是主要目标。模型元件的设计使我们的模型能够捕捉到反应共变体之间的关系中潜在的复杂互动效应。此外,我们的模型不需要连续的共变体分解,因此适合许多此类共变体的问题。此外,我们设计了一种适当的算法,这种算法是由梯度推升以及模型元件的模型选择和评估的高效程序所启发的,这种模型空间和近似受到制约,加速了时间成本计算的速度。我们的模型除了绝对有必要成为更高层面的现实替代物外,这些技术还可能有用,作为设计其他类型相交回归模型的有效模型选择算法的基础。我们在模拟研究中探讨了我们的方法的特点,特别是将其与自然替代法,例如逻辑回归、经典提振模型和惩罚性物流回归法等进行比较。我们还介绍了我们在威斯康辛乳腺癌数据集和波士顿住房数据集方面的做法。结果显示,我们的方法的离异性率比其他方法要好。