学习带有时间加权对数损失的不稳定动态系统 (Learning Unstable Dynamical Systems with Time-Weighted Logarithmic Loss)

When training the parameters of a linear dynamical model, the gradient descent algorithm is likely to fail to converge if the squared-error loss is used as the training loss function. Restricting the parameter space to a smaller subset and running the gradient descent algorithm within this subset can allow learning stable dynamical systems, but this strategy does not work for unstable systems. In this work, we look into the dynamics of the gradient descent algorithm and pinpoint what causes the difficulty of learning unstable systems. We show that observations taken at different times from the system to be learned influence the dynamics of the gradient descent algorithm in substantially different degrees. We introduce a time-weighted logarithmic loss function to fix this imbalance and demonstrate its effectiveness in learning unstable systems.

翻译：当培训线性动态模型的参数时,如果将正方位偏差损失用作培训损失函数,梯度下限算法可能无法趋同。将参数空间限制在较小的子集,并在子集内运行梯度下限算法,可以学习稳定的动态系统,但这一战略对不稳定系统不起作用。在这项工作中,我们研究梯度下限算法的动态,并找出学习不稳定系统的困难原因。我们显示,在系统的不同时间所观测到的情况对梯度下限算法的动态影响程度大不相同。我们引入了时间加权对数损失函数,以纠正这种不平衡,并表明其在学习不稳定系统方面的有效性。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

【机器学习最优化课程笔记】Optimization for Machine Learning，36页pdf

专知会员服务

117+阅读 · 2020年3月25日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日