Supervised machine learning approaches require the formulation of a loss functional to be minimized in the training phase. Sequential data are ubiquitous across many fields of research, and are often treated with Euclidean distance-based loss functions that were designed for tabular data. For smooth oscillatory data, those conventional approaches lack the ability to penalize amplitude, frequency and phase prediction errors at the same time, and tend to be biased towards amplitude errors. We introduce the surface similarity parameter (SSP) as a novel loss function that is especially useful for training machine learning models on smooth oscillatory sequences. Our extensive experiments on chaotic spatio-temporal dynamics systems indicate that the SSP is beneficial for shaping gradients, thereby accelerating the training process, reducing the final prediction error, increasing weight initialization robustness, and implementing a stronger regularization effect compared to using classical loss functions. The results indicate the potential of the novel loss metric particularly for highly complex and chaotic data, such as data stemming from the nonlinear two-dimensional Kuramoto-Sivashinsky equation and the linear propagation of dispersive surface gravity waves in fluids.
翻译:受监督的机器学习方法要求设计一个损失功能,以便在培训阶段尽量减少。 序列数据在许多研究领域无处不在,而且往往使用为表格数据设计的Euclide的远程损失功能处理。对于光滑的视觉数据来说,这些常规方法没有能力同时惩罚振幅、频率和阶段预测错误,而且往往偏向于振幅错误。我们引入了表面相似参数(SSP),作为一个新的损失函数,对光滑血管序列的机器学习模型特别有用。我们在混乱的时空动态动态动态系统上进行的广泛实验表明,SSP有利于形成梯度,从而加快培训过程,减少最终预测错误,提高重量初始性,并比使用经典损失功能实施更强有力的调整效应。结果显示,新的损失指标有可能特别用于高度复杂和混乱的数据,例如来自非线性二维的Kuramamoto-Sivashinsky等式数据以及流体中分散式地心重波的线性波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波波及波波波波波波波波波波波波波波波波波波波波波及波波波波波波及波及波波及波及波及波及波及波及波及波及波及波及波及波及波波及波及波及波及波及波及波及波及波及波及波及波及波及波及波及波波波及波及波及波及波及波及波及波及波及波及波波波波波波波及波及波及波及波及波及波波波及波及波及波及波及波及波及波及波及波及波波)的数据。