The model selection procedure is usually a single-criterion decision making in which we select the model that maximizes a specific metric in a specific set, such as the Validation set performance. We claim this is very naive and can perform poor selections of over-fitted models due to the over-searching phenomenon, which over-estimates the performance on that specific set. Futhermore, real world data contains noise that should not be ignored by the model selection procedure and must be taken into account when performing model selection. Also, we have defined four theoretical optimality conditions that we can pursue to better select the models and analyze them by using a multi-criteria decision-making algorithm (TOPSIS) that considers proxies to the optimality conditions to select reasonable models.
翻译:模式选择程序通常是单标准决策,我们选择的模型在具体集中最大限度地使用具体计量标准,例如验证集的性能。我们称,这非常天真,而且由于过度搜索现象,高估了该特定集的性能,因此对超合适模型的选择不力。更长远地说,真实的世界数据含有噪音,不应被模式选择程序所忽视,在进行模型选择时必须加以考虑。此外,我们还确定了四个理论最佳性条件,我们可以通过采用多标准决策算法(TOPSIS),考虑选择合理模型的最佳性条件,更好地选择模型并分析模型。