This paper attempts to analyze the effectiveness of deep learning for tabular data processing. It is believed that decision trees and their ensembles is the leading method in this domain, and deep neural networks must be content with computer vision and so on. But the deep neural network is a framework for building gradient-based hierarchical representations, and this key feature should be able to provide the best processing of generic structured (tabular) data, not just image matrices and audio spectrograms. This problem is considered through the prism of the Weather Prediction track in the Yandex Shifts challenge (in other words, the Yandex Shifts Weather task). This task is a variant of the classical tabular data regression problem. It is also connected with another important problem: generalization and uncertainty in machine learning. This paper proposes an end-to-end algorithm for solving the problem of regression with uncertainty on tabular data, which is based on the combination of four ideas: 1) deep ensemble of self-normalizing neural networks, 2) regression as parameter estimation of the Gaussian target error distribution, 3) hierarchical multitask learning, and 4) simple data preprocessing. Three modifications of the proposed algorithm form the top-3 leaderboard of the Yandex Shifts Weather challenge respectively. This paper considers that this success has occurred due to the fundamental properties of the deep learning algorithm, and tries to prove this.
翻译:本文试图分析为表格数据处理而深层次学习的有效性。 据信, 决策树及其组合是该领域的主要方法, 深神经网络必须满足计算机的视觉等。 但深神经网络是建立基于梯度的等级表示的框架, 而这一关键特征应该能够提供最佳处理通用结构( 图表) 数据, 而不仅仅是图像矩阵和音频光谱。 这个问题通过Yandex Shifts 挑战( 换句话说, Yandex Shifts 气象任务) 的天气预报轨道的棱镜来考虑。 这是一项典型的表格数据回归问题的一个变体。 它也与另一个重要问题相关: 机器学习中的概括化和不确定性。 本文提出了一种端到端的算法, 以解决以表数据不确定性为基础的回归问题, 其基础是以下四个概念的组合:(1) 深度的自我正常化神经网络的集合,(2) 回归作为Gaussals 目标错误分布的参数估计, 3级多任务学习, 和 4) 简单的数据回归问题。 简单的数据前演算法的三次修改分别证明了YA-3 的深度演算, 最终的演算的演算, 最终的演算的演算的演算, 最终的演算的演算结果, 最终的演算结果的演算结果, 已经分别证明了了YA- 。