Model selection via penalized likelihood type criteria is a standard task in many statistical inference and machine learning problems. It has led to deriving criteria with asymptotic consistency results and an increasing emphasis on introducing non-asymptotic criteria. We focus on the problem of modeling non-linear relationships in regression data with potential hidden graph-structured interactions between the high-dimensional predictors, within the mixture of experts modeling framework. In order to deal with such a complex situation, we investigate a block-diagonal localized mixture of polynomial experts (BLoMPE) regression model, which is constructed upon an inverse regression and block-diagonal structures of the Gaussian expert covariance matrices. We introduce a penalized maximum likelihood selection criterion to estimate the unknown conditional density of the regression model. This model selection criterion allows us to handle the challenging problem of inferring the number of mixture components, the degree of polynomial mean functions, and the hidden block-diagonal structures of the covariance matrices, which reduces the number of parameters to be estimated and leads to a trade-off between complexity and sparsity in the model. In particular, we provide a strong theoretical guarantee: a finite-sample oracle inequality satisfied by the penalized maximum likelihood estimator with a Jensen-Kullback-Leibler type loss, to support the introduced non-asymptotic model selection criterion. The penalty shape of this criterion depends on the complexity of the considered random subcollection of BLoMPE models, including the relevant graph structures, the degree of polynomial mean functions, and the number of mixture components.
翻译:通过不确定可能性类型选择模型是许多统计推论和机器学习问题的一个标准任务。它导致以无症状一致性结果和日益强调引入非痛苦性标准来得出标准。我们侧重于在回归数据中模拟非线性关系的问题,在高维预测器之间,在专家建模的混合框架内,在高维预测器之间可能隐藏的图形结构互动。为了处理这种复杂的情况,我们调查了由多价货币专家(BLOMPE)的组合-直径局部组合回归模型(BLOMPE)的回归模型,该模型以高斯专家常态矩阵的反回归和块对角结构结构为基础。我们引入了一种受限制的最大可能性选择标准,以估计回归模型的未知条件密度。这个模型选择标准使我们能够处理一个具有挑战性的问题,即判断混合物组成部分的数量,多价函数的程度,以及混合值矩阵的隐藏的块性直径对立结构,该模型的相关参数数量会减少,并导致高点专家常态常态差异性结构之间的非贸易性结构。我们提供了一种严格的精度标准,其中包括B级模型的精度的精度或卡度标准的精度。