采用低度扩散概率模型统一人类动力合成和样式转让 (Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models)

Generating realistic motions for digital humans is a core but challenging part of computer animations and games, as human motions are both diverse in content and rich in styles. While the latest deep learning approaches have made significant advancements in this domain, they mostly consider motion synthesis and style manipulation as two separate problems. This is mainly due to the challenge of learning both motion contents that account for the inter-class behaviour and styles that account for the intra-class behaviour effectively in a common representation. To tackle this challenge, we propose a denoising diffusion probabilistic model solution for styled motion synthesis. As diffusion models have a high capacity brought by the injection of stochasticity, we can represent both inter-class motion content and intra-class style behaviour in the same latent. This results in an integrated, end-to-end trained pipeline that facilitates the generation of optimal motion and exploration of content-style coupled latent space. To achieve high-quality results, we design a multi-task architecture of diffusion model that strategically generates aspects of human motions for local guidance. We also design adversarial and physical regulations for global guidance. We demonstrate superior performance with quantitative and qualitative results and validate the effectiveness of our multi-task architecture.

翻译：为数字人类带来现实的动作是计算机动画和游戏的核心但具有挑战性的一部分,因为人类动画和游戏的核心部分是具有挑战性的,因为人类动画在内容和风格上都是多种多样的。虽然最新的深层次学习方法在这一领域取得了显著进步,但它们大多认为运动合成和风格操控是两个不同的问题。这主要是由于需要学习运动内容,这些运动内容反映了不同阶级之间的行为和风格,以共同的代表性有效地反映不同阶级内部行为。为了应对这一挑战,我们建议为风格化的动画合成设计一个分散传播的多任务模型模型。由于传播模型由于注入随机性而具有很高的能力,我们也可以代表不同阶级之间的运动内容和同一潜力的同类风格行为。这体现在一个综合的、经过最终培训的管道上,该管道有助于产生最佳的移动和探索与内容风格相伴的潜伏空间。为了取得高质量的结果,我们设计了一个多任务传播模型的模型,从战略角度为地方指导提供人类动作的各个方面。我们还设计了全球指导的对立和物理规范。我们用定量和定性的架构展示了高性业绩,并验证了我们多任务的有效性。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日