Conventional supervised learning methods typically assume i.i.d samples and are found to be sensitive to out-of-distribution (OOD) data. We propose Generative Causal Representation Learning (GCRL) which leverages causality to facilitate knowledge transfer under distribution shifts. While we evaluate the effectiveness of our proposed method in human trajectory prediction models, GCRL can be applied to other domains as well. First, we propose a novel causal model that explains the generative factors in motion forecasting datasets using features that are common across all environments and with features that are specific to each environment. Selection variables are used to determine which parts of the model can be directly transferred to a new environment without fine-tuning. Second, we propose an end-to-end variational learning paradigm to learn the causal mechanisms that generate observations from features. GCRL is supported by strong theoretical results that imply identifiability of the causal model under certain assumptions. Experimental results on synthetic and real-world motion forecasting datasets show the robustness and effectiveness of our proposed method for knowledge transfer under zero-shot and low-shot settings by substantially outperforming the prior motion forecasting models on out-of-distribution prediction.
翻译:常规监督的学习方法通常假定i.d样本,并被认为对分配外数据敏感。我们提议“创形代表学习”利用因果关系来推动分布变化情况下的知识转让。我们评估了人类轨迹预测模型中拟议方法的有效性,同时,可将其应用于其他领域。首先,我们提出了一个新的因果模型,用以解释运动预测数据集中的基因变异因素,该模型的特征在所有环境中都是共同的,并具有每个环境特有的特征。选择变量用来确定模型的哪些部分可以直接转移到新的环境而不作微调。第二,我们提议了一个端到端变异学习模式,以学习产生特征观测的因果关系机制。GCRL得到强有力的理论结果的支持,这意味着在某些假设下可以识别因果关系模型。合成和真实世界运动预测数据集的实验结果表明,我们提议的零射和低射场知识转让方法的稳健性和有效性,其方法是大大超过预测外向外的先前运动预测模型。