We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation. TDW enables simulation of high-fidelity sensory data and physical interactions between mobile agents and objects in rich 3D environments. Unique properties include: real-time near-photo-realistic image rendering; a library of objects and environments, and routines for their customization; generative procedures for efficiently building classes of new environments; high-fidelity audio rendering; realistic physical interactions for a variety of material types, including cloths, liquid, and deformable objects; customizable agents that embody AI agents; and support for human interactions with VR devices. TDW's API enables multiple agents to interact within a simulation and returns a range of sensor and physics data representing the state of the world. We present initial experiments enabled by TDW in emerging research directions in computer vision, machine learning, and cognitive science, including multi-modal physical scene understanding, physical dynamics predictions, multi-agent interactions, models that learn like a child, and attention studies in humans and neural networks.
翻译:我们引入了三维世界(TDW),这是一个互动、多模式物理模拟的平台。TDW能够模拟高纤维感官数据和移动剂和物体在丰富的三维环境中的物理互动。独特的特性包括:实时近光现实图像制作;物体和环境图书馆及其定制常规;高效建造新环境类别的基因化程序;高纤维音频传输;各种材料类型的现实物理互动,包括布料、液体和变形物体;可定制的体现AI剂的活性剂;以及支持人类与VR装置的互动。TDW的API使多种物剂能够在模拟中进行互动,并返回一系列代表世界状况的传感器和物理学数据。我们介绍了TDW在计算机视觉、机器学习和认知科学等新兴研究方向上促成的初步实验,包括多模式物理场了解、物理动态预测、多剂互动、像儿童一样学习的模式以及人类和神经网络的注意力研究。