Model selection, via penalized likelihood type criteria, is a standard task in many statistical inference and machine learning problems. Progress has led to deriving criteria with asymptotic consistency results and an increasing emphasis on introducing non-asymptotic criteria. We focus on the problem of modeling non-linear relationships in regression data with potential hidden graph-structured interactions between the high-dimensional predictors, within the mixture of experts modeling framework. In order to deal with such a complex situation, we investigate a block-diagonal localized mixture of polynomial experts (BLoMPE) regression model, which is constructed upon an inverse regression and block-diagonal structures of the Gaussian expert covariance matrices. We introduce a penalized maximum likelihood selection criterion to estimate the unknown conditional density of the regression model. This model selection criterion allows us to handle the challenging problem of inferring the number of mixture components, the degree of polynomial mean functions, and the hidden block-diagonal structures of the covariance matrices, which reduces the number of parameters to be estimated and leads to a trade-off between complexity and sparsity in the model. In particular, we provide a strong theoretical guarantee: a finite-sample oracle inequality satisfied by the penalized maximum likelihood estimator with a Jensen-Kullback-Leibler type loss, to support the introduced non-asymptotic model selection criterion. The penalty shape of this criterion depends on the complexity of the considered random subcollection of BLoMPE models, including the relevant graph structures, the degree of polynomial mean functions, and the number of mixture components.
翻译:模型选择,通过受抑制的可能性类型标准,是许多统计性推断和机器学习问题中的一项标准任务。进展导致以无症状一致性结果和日益强调采用非痛苦性标准来得出标准。我们注重在回归数据中模拟非线性关系的问题,在高维预测器之间,在专家建模的混合框架内进行潜在的隐藏的图形结构互动。为了处理这种复杂情况,我们调查了多尼米亚专家(BLOMPE)回归模型的块-直径局部混合物。该模型是在高斯专家常态矩阵的反回归和块形直径结构结构结构基础上建立的。我们采用了一个受限的最大可能性选择标准,以估计回归模型的未知条件密度。这个模型选择标准使我们能够处理一个具有挑战性的问题,即判断混合物成分的数量,多尼米平均功能的程度,以及聚合变异矩阵的隐藏的块-直线性结构,它减少了估计的参数数量,并导致高尼基专家专家专家常量矩阵的反回归性结构。我们引入了一个受限的最大可能性选择标准,其中包括高尼基标准中的极性标准。