TPA-Net:为文字生成一套基于物理的动画 (TPA-Net: Generate A Dataset for Text to Physics-based Animation)

Recent breakthroughs in Vision-Language (V&L) joint research have achieved remarkable results in various text-driven tasks. High-quality Text-to-video (T2V), a task that has been long considered mission-impossible, was proven feasible with reasonably good results in latest works. However, the resulting videos often have undesired artifacts largely because the system is purely data-driven and agnostic to the physical laws. To tackle this issue and further push T2V towards high-level physical realism, we present an autonomous data generation technique and a dataset, which intend to narrow the gap with a large number of multi-modal, 3D Text-to-Video/Simulation (T2V/S) data. In the dataset, we provide high-resolution 3D physical simulations for both solids and fluids, along with textual descriptions of the physical phenomena. We take advantage of state-of-the-art physical simulation methods (i) Incremental Potential Contact (IPC) and (ii) Material Point Method (MPM) to simulate diverse scenarios, including elastic deformations, material fractures, collisions, turbulence, etc. Additionally, high-quality, multi-view rendering videos are supplied for the benefit of T2V, Neural Radiance Fields (NeRF), and other communities. This work is the first step towards fully automated Text-to-Video/Simulation (T2V/S). Live examples and subsequent work are at https://sites.google.com/view/tpa-net.

翻译：在视觉-语言(V&L)联合研究中最近出现的突破在各种文本驱动的任务中取得了显著成果。高质量的文本到视频(T2V)这一长期被视为任务不可能的任务,事实证明是可行的,在最近的作品中取得了相当好的结果。然而,所产生的视频往往具有不受欢迎的文物,主要是因为该系统纯粹是数据驱动的,并且对物理法则具有不可知性。为了解决这一问题并进一步将T2V推向高层次物理现实主义,我们展示了自主数据生成技术和数据集,其目的是缩小与大量多模式、3D文本到视频/模拟(T2V/S)数据之间的差距。在数据集中,我们提供高分辨率的3D物理模拟,同时对物理法则进行文字描述。我们利用了最先进的物理模拟方法(i) 增量潜力联系(IPC) 和(ii) 材料点方法(MPM) 模拟多种模型,包括高分辨率的多版本、高分辨率的图像、高分辨率的实地变压/变压的图像。我们利用了高分辨率的图像/变压的图像/变现。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日