This paper focuses on the problem of semi-supervised domain adaptation for time-series forecasting, which is underexplored in literatures, despite being often encountered in practice. Existing methods on time-series domain adaptation mainly follow the paradigm designed for the static data, which cannot handle domain-specific complex conditional dependencies raised by data offset, time lags, and variant data distributions. In order to address these challenges, we analyze variational conditional dependencies in time-series data and find that the causal structures are usually stable among domains, and further raise the causal conditional shift assumption. Enlightened by this assumption, we consider the causal generation process for time-series data and propose an end-to-end model for the semi-supervised domain adaptation problem on time-series forecasting. Our method can not only discover the Granger-Causal structures among cross-domain data but also address the cross-domain time-series forecasting problem with accurate and interpretable predicted results. We further theoretically analyze the superiority of the proposed method, where the generalization error on the target domain is bounded by the empirical risks and by the discrepancy between the causal structures from different domains. Experimental results on both synthetic and real data demonstrate the effectiveness of our method for the semi-supervised domain adaptation method on time-series forecasting.
翻译:本文侧重于时间序列预测的半监督域适应问题,尽管在实践上经常遇到实践,但文献中未充分探讨这个问题。时间序列领域适应的现有方法主要遵循为静态数据设计的范式,即无法处理数据抵消、时间滞后和变式数据分布引起的特定领域复杂有条件依赖性。为了应对这些挑战,我们分析了时间序列数据中因果结构在时间序列数据中的差异性有条件依赖性,发现因果结构通常在域间保持稳定,并进一步提高因果性有条件转移假设。我们根据这一假设,考虑时间序列数据因果生成过程,并为时间序列预测中的半监督域适应问题提出一个端对端模型。我们的方法不仅不能在跨域数据中发现Granger-Causal结构,而且不能解决具有准确和可解释预测结果的跨领域时间序列预测问题。我们进一步从理论上分析了拟议方法的优越性,在目标领域的总体错误受实证风险的约束,并且由不同领域对因果性预测方法的半透明性预测结果展示。