交互复制：从人类动作中追踪人物-物体交互和场景变化 (Interaction Replica: Tracking human-object interaction and scene changes from human motion) - 专知论文

会员服务 ·

0

交互 · 惯性传感 · 惯性传感器 · 感知数据 · 姿态估计 ·

2023 年 3 月 31 日

Interaction Replica: Tracking human-object interaction and scene changes from human motion

翻译：交互复制：从人类动作中追踪人物-物体交互和场景变化

Vladimir Guzov,Julian Chibane,Riccardo Marin,Yannan He,Torsten Sattler,Gerard Pons-Moll

Humans naturally change their environment through interactions, e.g., by opening doors or moving furniture. To reproduce such interactions in virtual spaces (e.g., metaverse), we need to capture and model them, including changes in the scene geometry, ideally from egocentric input alone (head camera and body-worn inertial sensors). While the head camera can be used to localize the person in the scene, estimating dynamic object pose is much more challenging. As the object is often not visible from the head camera (e.g., a human not looking at a chair while sitting down), we can not rely on visual object pose estimation. Instead, our key observation is that human motion tells us a lot about scene changes. Motivated by this, we present iReplica, the first human-object interaction reasoning method which can track objects and scene changes based solely on human motion. iReplica is an essential first step towards advanced AR/VR applications in immersive virtual universes and can provide human-centric training data to teach machines to interact with their surroundings. Our code, data and model will be available on our project page at http://virtualhumans.mpi-inf.mpg.de/ireplica/

翻译：人类通过与物体的交互自然地改变着环境，例如开门或移动家具。为了在虚拟空间中（例如元宇宙）再现这种交互，我们需要捕捉和建模它们，包括场景几何变化，最理想的情况是仅用自身感知数据（头戴相机和佩戴的惯性传感器）即可实现。虽然头戴相机可以用于定位人在场景中的位置，估计动态物体姿态却要更具有挑战性。因为在许多情况下，从头戴相机中看不到物体（例如人坐下时不看椅子），所以我们不能依赖于视觉上的物体姿态估计。相反，我们的关键观察是人类动作可以告诉我们很多关于场景变化的信息。为了实现此目的，我们提出 iReplica，这是第一个可以仅仅基于人类动作来追踪物体和场景变化的人物-物体交互推理方法。iReplica是进一步实现沉浸式虚拟世界高级 AR/VR 应用的关键第一步，可以为教导机器与其环境交互提供以人为中心的训练数据。我们的代码、数据和模型将在我们的项目页面 http://virtualhumans.mpi-inf.mpg.de/ireplica/ 上提供。

1

相关内容

【CVPR2022】多视图聚合的大规模三维语义分割

【CVPR2022】多视图聚合的大规模三维语义分割

专知会员服务

21+阅读 · 2022年4月20日

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

专知会员服务

26+阅读 · 2020年7月4日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

29+阅读 · 2020年5月19日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

大白话用Transformer做BEV 3D目标检测

大白话用Transformer做BEV 3D目标检测

PaperWeekly

1+阅读 · 2022年6月7日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

专知

12+阅读 · 2018年6月9日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

基于光栅投射立体视觉的暗环境中移动机器人视觉导航方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于空间感知的混合现实徒手自然交互技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

同伴游戏场景中的运动跟踪与行为分析

国家自然科学基金

0+阅读 · 2012年12月31日

高效中红外激光晶体Cr,Er,Re:YSGG（Re＝Eu3+, Tb3+）的生长及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向增强现实的虚拟化身行为建模关键技术研究

国家自然科学基金

6+阅读 · 2011年12月31日

基于三维视频的人脸表情识别研究

国家自然科学基金

0+阅读 · 2011年12月31日

融合多深度的复杂场景多视点采样与重建的基础理论与关键技术

国家自然科学基金

0+阅读 · 2011年12月31日

虚拟人的连续运动控制研究

国家自然科学基金

2+阅读 · 2011年12月31日

基于手绘语义地图的室内泛在感知网络下移动机器人视觉交互导航研究

国家自然科学基金

2+阅读 · 2011年12月31日

增强现实中多目标3D跟踪定位和WH-SIFT特征识别方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes

Arxiv

0+阅读 · 2023年5月19日

LMEye: An Interactive Perception Network for Large Language Models

Arxiv

0+阅读 · 2023年5月18日

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Arxiv

2+阅读 · 2023年5月18日

Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月17日

Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modality

Arxiv

0+阅读 · 2023年5月17日

DesignTracking: Track and Replay BIM-based Design Process

Arxiv

0+阅读 · 2023年5月17日

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

Arxiv

14+阅读 · 2022年3月25日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

Image Manipulation Detection by Multi-View Multi-Scale Supervision

Arxiv

13+阅读 · 2021年7月25日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

VIP会员

文章信息

相关主题

惯性传感器

相关VIP内容

【CVPR2022】多视图聚合的大规模三维语义分割

【CVPR2022】多视图聚合的大规模三维语义分割

专知会员服务

21+阅读 · 2022年4月20日

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

专知会员服务

26+阅读 · 2020年7月4日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

29+阅读 · 2020年5月19日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《FPV武装无人机的战斗飞行艺术与科学》最新报告

《基于分层多智能体强化学习的逼真空战协同策略》

《反无人机：用于无人机探测与定位的多输入多输出雷达》最新69页

《生成式人工智能及其在防御性网络安全课程中的应用》

相关资讯

大白话用Transformer做BEV 3D目标检测

大白话用Transformer做BEV 3D目标检测

PaperWeekly

1+阅读 · 2022年6月7日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

【论文推荐】最新六篇视觉问答相关论文—深度嵌入学习、句子表征学习、深度特征聚合、3D匹配、细粒度文本摘要

专知

12+阅读 · 2018年6月9日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes

Arxiv

0+阅读 · 2023年5月19日

LMEye: An Interactive Perception Network for Large Language Models

Arxiv

0+阅读 · 2023年5月18日

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Arxiv

2+阅读 · 2023年5月18日

Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月17日

Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modality

Arxiv

0+阅读 · 2023年5月17日

DesignTracking: Track and Replay BIM-based Design Process

Arxiv

0+阅读 · 2023年5月17日

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

Arxiv

14+阅读 · 2022年3月25日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

Image Manipulation Detection by Multi-View Multi-Scale Supervision

Arxiv

13+阅读 · 2021年7月25日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

相关基金

基于光栅投射立体视觉的暗环境中移动机器人视觉导航方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于空间感知的混合现实徒手自然交互技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

同伴游戏场景中的运动跟踪与行为分析

国家自然科学基金

0+阅读 · 2012年12月31日

高效中红外激光晶体Cr,Er,Re:YSGG（Re＝Eu3+, Tb3+）的生长及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向增强现实的虚拟化身行为建模关键技术研究

国家自然科学基金

6+阅读 · 2011年12月31日

基于三维视频的人脸表情识别研究

国家自然科学基金

0+阅读 · 2011年12月31日

融合多深度的复杂场景多视点采样与重建的基础理论与关键技术

国家自然科学基金

0+阅读 · 2011年12月31日

虚拟人的连续运动控制研究

国家自然科学基金

2+阅读 · 2011年12月31日

基于手绘语义地图的室内泛在感知网络下移动机器人视觉交互导航研究

国家自然科学基金

2+阅读 · 2011年12月31日

增强现实中多目标3D跟踪定位和WH-SIFT特征识别方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员