Segmented models are widely used to describe non-stationary sequential data with discrete change points. Their estimation usually requires solving a mixed discrete-continuous optimization problem, where the segmentation is the discrete part and all other model parameters are continuous. A number of estimation algorithms have been developed that are highly specialized for their specific model assumptions. The dependence on non-standard algorithms makes it hard to integrate segmented models in state-of-the-art deep learning architectures that critically depend on gradient-based optimization techniques. In this work, we formulate a relaxed variant of segmented models that enables joint estimation of all model parameters, including the segmentation, with gradient descent. We build on recent advances in learning continuous warping functions and propose a novel family of warping functions based on the two-sided power (TSP) distribution. TSP-based warping functions are differentiable, have simple closed-form expressions, and can represent segmentation functions exactly. Our formulation includes the important class of segmented generalized linear models as a special case, which makes it highly versatile. We use our approach to model the spread of COVID-19 with Poisson regression, apply it on a change point detection task, and learn classification models with concept drift. The experiments show that our approach effectively learns all these tasks with standard algorithms for gradient descent.
翻译:分解模型被广泛用于描述非静止相继数据,并带有离散的变化点。它们的估算通常需要解决一个混合的离散连续优化问题,即分解是离散部分,所有其他模型参数都是连续的。一些估算算法已经开发出来,对其具体的模型假设具有高度的专业性。对非标准算法的依赖使得很难将分解模型纳入高度依赖梯度优化技术的先进深层次学习结构中。在这项工作中,我们制定了一个松散的模型变体,以便能够联合估计所有模型参数,包括分解和梯度下降。我们在学习连续扭曲功能方面最近取得的进展的基础上,提出了基于双向能力分布的新型扭曲函数。基于TSP的扭曲算法功能是不同的,具有简单的封闭式表达方式,可以代表分解功能。我们的构件包括分解的普通直线模型的重要类别,作为特殊案例,因此具有高度的灵活性。我们用我们的方法来模拟COVI-19的扩展和Poisson的回归方法。我们利用了最近的进展,学习了基于双向值分布法的分级模型,并有效地学习了我们的标准递化任务。