We consider a new problem of adapting a human mesh reconstruction model to out-of-domain streaming videos, where performance of existing SMPL-based models are significantly affected by the distribution shift represented by different camera parameters, bone lengths, backgrounds, and occlusions. We tackle this problem through online adaptation, gradually correcting the model bias during testing. There are two main challenges: First, the lack of 3D annotations increases the training difficulty and results in 3D ambiguities. Second, non-stationary data distribution makes it difficult to strike a balance between fitting regular frames and hard samples with severe occlusions or dramatic changes. To this end, we propose the Dynamic Bilevel Online Adaptation algorithm (DynaBOA). It first introduces the temporal constraints to compensate for the unavailable 3D annotations, and leverages a bilevel optimization procedure to address the conflicts between multi-objectives. DynaBOA provides additional 3D guidance by co-training with similar source examples retrieved efficiently despite the distribution shift. Furthermore, it can adaptively adjust the number of optimization steps on individual frames to fully fit hard samples and avoid overfitting regular frames. DynaBOA achieves state-of-the-art results on three out-of-domain human mesh reconstruction benchmarks.
翻译:我们考虑将人类网目重建模型改造为外部流动视频的新问题,即现有基于SMPL的模型的性能受到不同相机参数、骨长度、背景和隐蔽度代表的分布变化的重大影响。我们通过在线适应,逐步纠正测试中的模型偏差来解决这个问题。有两个主要挑战:第一,缺乏3D说明增加了培训难度和3D模糊度的结果。第二,非静止数据分布使得很难在适合正常框架和具有严重隐蔽或急剧变化的硬样本之间取得平衡。为此,我们提议了动态双级在线适应算法(DynaBOA ) 。它首先引入了时间限制,以弥补无法提供的3D说明,并运用双级优化程序解决多目标之间的冲突。DynaBOA 提供了额外的3D指导,通过在分布变化中有效检索的类似来源实例进行联合培训。此外,它可以适应性地调整单个框架的优化步骤数量,使其完全适合硬标本,并避免过度配置常规框架。DynaBOA在三个基准上实现国家重建。