运动驱动: 具有传播模型的文本驱动人类运动生成 (MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model)

Human motion modeling is important for many modern graphics applications, which typically require professional skills. In order to remove the skill barriers for laymen, recent motion generation methods can directly generate human motions conditioned on natural languages. However, it remains challenging to achieve diverse and fine-grained motion generation with various text inputs. To address this problem, we propose MotionDiffuse, the first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods. 1) Probabilistic Mapping. Instead of a deterministic language-motion mapping, MotionDiffuse generates motions through a series of denoising steps in which variations are injected. 2) Realistic Synthesis. MotionDiffuse excels at modeling complicated data distribution and generating vivid motion sequences. 3) Multi-Level Manipulation. MotionDiffuse responds to fine-grained instructions on body parts, and arbitrary-length motion synthesis with time-varied text prompts. Our experiments show MotionDiffuse outperforms existing SoTA methods by convincing margins on text-driven motion generation and action-conditioned motion generation. A qualitative analysis further demonstrates MotionDiffuse's controllability for comprehensive motion generation. Homepage: https://mingyuan-zhang.github.io/projects/MotionDiffuse.html

翻译：人类运动模型对于许多现代图形应用十分重要,这些应用通常需要专业技能。为了消除外行人的技能障碍,最近的运动生成方法可以直接产生以自然语言为条件的人类运动。然而,用各种文字投入实现多样化和精细的动作生成仍然具有挑战性。为解决这一问题,我们提议采用以文字驱动的首次传播模型驱动的动作生成框架,即运动Diffuse,它展示了现有方法的几种期望特性。 (1) 概率映射。它不是用确定性的语言移动映射,而是通过一系列注入变异的分化步骤产生动作。(2) Realistic Asyn综合。运动在模拟复杂的数据分配和生成生动运动序列方面仍然很出色。(3) 多层次的调控。运动是响应以微缩式指令,以及任意的长动动作合成,提示了时间变异的文字。我们的实验显示运动Diffuse 超越了现有的 SoTA方法,方法是在文本驱动的生成和动作生成上说服边缘。(2) Realisticalimmedition Annation/HisimputiveDimation。 assimalalassing:motionDligistrings

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MM 2021】基于单张图像的多风格说话人合成，Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis

专知会员服务

6+阅读 · 2022年3月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日