从单摄像头中抓取 3D 多人类运动 (Scene-Aware 3D Multi-Human Motion Capture from a Single Camera) - 专知论文

会员服务 ·

0

3D · 估计/估计量 · 规范化的 · 塑造 · 缩放 ·

2023 年 1 月 12 日

Scene-Aware 3D Multi-Human Motion Capture from a Single Camera

翻译：从单摄像头中抓取 3D 多人类运动

Diogo Luvizon,Marc Habermann,Vladislav Golyanik,Adam Kortylewski,Christian Theobalt

from arxiv, Github: https://github.com/dluvizon/scene-aware-3d-multi-human

In this work, we consider the problem of estimating the 3D position of multiple humans in a scene as well as their body shape and articulation from a single RGB video recorded with a static camera. In contrast to expensive marker-based or multi-view systems, our lightweight setup is ideal for private users as it enables an affordable 3D motion capture that is easy to install and does not require expert knowledge. To deal with this challenging setting, we leverage recent advances in computer vision using large-scale pre-trained models for a variety of modalities, including 2D body joints, joint angles, normalized disparity maps, and human segmentation masks. Thus, we introduce the first non-linear optimization-based approach that jointly solves for the absolute 3D position of each human, their articulated pose, their individual shapes as well as the scale of the scene. In particular, we estimate the scene depth and person unique scale from normalized disparity predictions using the 2D body joints and joint angles. Given the per-frame scene depth, we reconstruct a point-cloud of the static scene in 3D space. Finally, given the per-frame 3D estimates of the humans and scene point-cloud, we perform a space-time coherent optimization over the video to ensure temporal, spatial and physical plausibility. We evaluate our method on established multi-person 3D human pose benchmarks where we consistently outperform previous methods and we qualitatively demonstrate that our method is robust to in-the-wild conditions including challenging scenes with people of different sizes.

翻译：在这项工作中,我们考虑如何估计多人类在现场的3D位置以及他们的身体形状,以及用一个静态相机录制的单一 RGB 视频来显示他们的身体形状和分解面。与昂贵的基于标记或多视图的系统相比,我们的轻量级设置对私人用户来说是理想的,因为它能够让一个负担得起的3D运动捕捉容易安装,不需要专家知识。为了应对这一具有挑战性的环境,我们利用大规模预先培训模型,在各种模式上利用计算机视觉方面的最新进展,包括2D身体接合、联合角度、标准差异图和人体分解面罩。因此,我们采用了第一个非线性优化法,共同解决每个人绝对的3D位置、其直观、个人形状以及场景的大小。我们特别利用2D身体联合点和联合角度,从正常差异预测中估算出场景的深度和个人独特规模。我们从每个平台的深度深度深度、共同角度重建了3D空间静态场景的点。最后,我们从一个连续的3D 人际空间定位方法到我们连续的深度的深度评估,我们用一个连续的图像和高度的图像模型来评估。

0

相关内容

3D是英文“Three Dimensions”的简称，中文是指三维、三个维度、三个坐标，即有长、有宽、有高，换句话说，就是立体的，是相对于只有长和宽的平面（2D）而言。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

ARHGAP9基因在肝癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

细胞外基质蛋白CTHRC1在HPV16/18型宫颈癌微环境中的调控作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大规模数据集3D手语识别的研究

国家自然科学基金

1+阅读 · 2014年12月31日

RERT-lncRNA调控EGLN2在肝细胞肝癌发生中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于3DGIS的山岭隧道动态智能安全预警机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

胎盘滋养层合体化必需分子syncytin单核苷酸多态性对基因转录和蛋白质功能的影响及与妊娠结局相关性的研究

国家自然科学基金

0+阅读 · 2012年12月31日

泛素特异性肽酶22（USP22）对关键基因转录的调控在胃癌发生发展中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

氧化-内质网应激通路在弓形虫感染致胎盘滋养细胞凋亡中的作用及干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP

Arxiv

0+阅读 · 2023年3月8日

RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes

Arxiv

0+阅读 · 2023年3月8日

Proactive Multi-Camera Collaboration For 3D Human Pose Estimation

Arxiv

0+阅读 · 2023年3月7日

A System for Generalized 3D Multi-Object Search

Arxiv

0+阅读 · 2023年3月6日

HybridCap: Inertia-aid Monocular Capture of Challenging Human Motions

Arxiv

0+阅读 · 2023年3月6日

Initial Task Allocation for Multi-Human Multi-Robot Teams with Attention-based Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年3月4日

An Information-Theoretic Characterization of MIMO-FAS: Optimization, Diversity-Multiplexing Tradeoff and $q$-Outage Capacity

Arxiv

0+阅读 · 2023年3月4日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

Monocular Object and Plane SLAM in Structured Environments

Monocular Object and Plane SLAM in Structured Environments

Arxiv

12+阅读 · 2018年9月10日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

相关论文

CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP

Arxiv

0+阅读 · 2023年3月8日

RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes

Arxiv

0+阅读 · 2023年3月8日

Proactive Multi-Camera Collaboration For 3D Human Pose Estimation

Arxiv

0+阅读 · 2023年3月7日

A System for Generalized 3D Multi-Object Search

Arxiv

0+阅读 · 2023年3月6日

HybridCap: Inertia-aid Monocular Capture of Challenging Human Motions

Arxiv

0+阅读 · 2023年3月6日

Initial Task Allocation for Multi-Human Multi-Robot Teams with Attention-based Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年3月4日

An Information-Theoretic Characterization of MIMO-FAS: Optimization, Diversity-Multiplexing Tradeoff and $q$-Outage Capacity

Arxiv

0+阅读 · 2023年3月4日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

Monocular Object and Plane SLAM in Structured Environments

Monocular Object and Plane SLAM in Structured Environments

Arxiv

12+阅读 · 2018年9月10日

相关基金

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

ARHGAP9基因在肝癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

细胞外基质蛋白CTHRC1在HPV16/18型宫颈癌微环境中的调控作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

大规模数据集3D手语识别的研究

国家自然科学基金

1+阅读 · 2014年12月31日

RERT-lncRNA调控EGLN2在肝细胞肝癌发生中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于3DGIS的山岭隧道动态智能安全预警机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

胎盘滋养层合体化必需分子syncytin单核苷酸多态性对基因转录和蛋白质功能的影响及与妊娠结局相关性的研究

国家自然科学基金

0+阅读 · 2012年12月31日

泛素特异性肽酶22（USP22）对关键基因转录的调控在胃癌发生发展中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

氧化-内质网应激通路在弓形虫感染致胎盘滋养细胞凋亡中的作用及干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员