即时神经体渲染：通过单目RGBD流进行人-物交互的即时体渲染 (Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream) - 专知论文

会员服务 ·

0

交互 · 行人 · 关键帧 · 非刚性 · 实时重建 ·

2023 年 4 月 6 日

Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream

翻译：即时神经体渲染：通过单目RGBD流进行人-物交互的即时体渲染

Yuheng Jiang,Kaixin Yao,Zhuo Su,Zhehao Shen,Haimin Luo,Lan Xu

from arxiv, CVPR 2023

Convenient 4D modeling of human-object interactions is essential for numerous applications. However, monocular tracking and rendering of complex interaction scenarios remain challenging. In this paper, we propose Instant-NVR, a neural approach for instant volumetric human-object tracking and rendering using a single RGBD camera. It bridges traditional non-rigid tracking with recent instant radiance field techniques via a multi-thread tracking-rendering mechanism. In the tracking front-end, we adopt a robust human-object capture scheme to provide sufficient motion priors. We further introduce a separated instant neural representation with a novel hybrid deformation module for the interacting scene. We also provide an on-the-fly reconstruction scheme of the dynamic/static radiance fields via efficient motion-prior searching. Moreover, we introduce an online key frame selection scheme and a rendering-aware refinement strategy to significantly improve the appearance details for online novel-view synthesis. Extensive experiments demonstrate the effectiveness and efficiency of our approach for the instant generation of human-object radiance fields on the fly, notably achieving real-time photo-realistic novel view synthesis under complex human-object interactions.

翻译：方便地进行人-物交互的4D建模对许多应用来说都是至关重要的，但是复杂交互场景的单目跟踪和渲染仍然具有挑战性。本文提出了Instant-NVR，这是一种利用单个RGBD相机进行即时体人-物跟踪和渲染的神经方法。它通过多线程追踪-渲染机制，将传统的非刚性跟踪与最近的即时光度场技术连接起来。在跟踪前端，我们采用鲁棒的人-物捕捉方案来提供足够的运动先验知识。我们进一步引入了一个分离的即时神经表示，并使用新型混合变形模块来发掘交互场景的局部信息。我们还通过高效的运动先验搜索提供了动态/静态光度场的实时重建机制。此外，我们引入了一种在线关键帧选择方案和一种渲染感知的细化策略，以显著提高在线新视图综合的外观细节。广泛的实验表明，我们的方法对于即时生成人-物光度场非常有效和高效，特别是在复杂的人-物交互场景下实现了实时照片般逼真的新视图综合。

0

相关内容

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

专知会员服务

25+阅读 · 2022年3月27日

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

专知会员服务

18+阅读 · 2022年3月15日

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

专知会员服务

23+阅读 · 2022年3月3日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【Google】神经辐射场，Neural Radiance Fields，74页ppt

专知会员服务

74+阅读 · 2021年5月28日

【CVPR2020-斯坦福】从RGB-D扫描对抗纹理优化，Adversarial Texture Optimization

【CVPR2020-斯坦福】从RGB-D扫描对抗纹理优化，Adversarial Texture Optimization

专知会员服务

17+阅读 · 2020年3月21日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日

近期必读的5篇AI顶会CVPR 2020 GNN (图神经网络) 相关论文

近期必读的5篇AI顶会CVPR 2020 GNN (图神经网络) 相关论文

专知会员服务

79+阅读 · 2020年3月3日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

人脸神经辐射场的掩码编辑方法NeRFFaceEditing，不会三维建模也能编辑立体人脸

人脸神经辐射场的掩码编辑方法NeRFFaceEditing，不会三维建模也能编辑立体人脸

机器之心

0+阅读 · 2022年11月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

专知

10+阅读 · 2018年4月22日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

基于虚拟螺旋运动坐标系的捷联速度算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于视频图像处理的神经导航空间配准方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于稀疏表示的在线视觉跟踪

国家自然科学基金

0+阅读 · 2014年12月31日

高精度流固耦合问题的动画生成方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

中心反折射全景相机标定- - 共形几何代数方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于阴影恢复技术的SAR三维重建与目标检测方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于Voronoi图的动态虚拟场景可见性计算方法

国家自然科学基金

0+阅读 · 2010年12月31日

基于2D视频视觉关注度的3D重建方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

单目移动拍摄下基于隐式形状模型的行人检测方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于领域唯一性彩色编码的实时三维视觉方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

NeRFuser: Large-Scale Scene Representation by NeRF Fusion

Arxiv

0+阅读 · 2023年5月22日

SG-GAN: Fine Stereoscopic-Aware Generation for 3D Brain Point Cloud Up-sampling from a Single Image

Arxiv

0+阅读 · 2023年5月22日

Points2Sound: From mono to binaural audio using 3D point cloud scenes

Arxiv

0+阅读 · 2023年5月19日

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields

Arxiv

0+阅读 · 2023年5月19日

ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis

Arxiv

1+阅读 · 2023年5月18日

MonoTDP: Twin Depth Perception for Monocular 3D Object Detection in Adverse Scenes

Arxiv

0+阅读 · 2023年5月18日

Dynamic Matrix Recovery

Arxiv

0+阅读 · 2023年5月17日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Geometric Deep Learning on Molecular Representations

Arxiv

12+阅读 · 2021年7月26日

Low-Shot Learning from Imaginary Data

Arxiv

15+阅读 · 2018年4月3日

VIP会员

文章信息

相关主题

相关VIP内容

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

专知会员服务

25+阅读 · 2022年3月27日

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

专知会员服务

18+阅读 · 2022年3月15日

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

专知会员服务

23+阅读 · 2022年3月3日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【Google】神经辐射场，Neural Radiance Fields，74页ppt

专知会员服务

74+阅读 · 2021年5月28日

【CVPR2020-斯坦福】从RGB-D扫描对抗纹理优化，Adversarial Texture Optimization

【CVPR2020-斯坦福】从RGB-D扫描对抗纹理优化，Adversarial Texture Optimization

专知会员服务

17+阅读 · 2020年3月21日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日

近期必读的5篇AI顶会CVPR 2020 GNN (图神经网络) 相关论文

近期必读的5篇AI顶会CVPR 2020 GNN (图神经网络) 相关论文

专知会员服务

79+阅读 · 2020年3月3日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

人脸神经辐射场的掩码编辑方法NeRFFaceEditing，不会三维建模也能编辑立体人脸

人脸神经辐射场的掩码编辑方法NeRFFaceEditing，不会三维建模也能编辑立体人脸

机器之心

0+阅读 · 2022年11月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

专知

10+阅读 · 2018年4月22日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

相关论文

NeRFuser: Large-Scale Scene Representation by NeRF Fusion

Arxiv

0+阅读 · 2023年5月22日

SG-GAN: Fine Stereoscopic-Aware Generation for 3D Brain Point Cloud Up-sampling from a Single Image

Arxiv

0+阅读 · 2023年5月22日

Points2Sound: From mono to binaural audio using 3D point cloud scenes

Arxiv

0+阅读 · 2023年5月19日

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields

Arxiv

0+阅读 · 2023年5月19日

ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis

Arxiv

1+阅读 · 2023年5月18日

MonoTDP: Twin Depth Perception for Monocular 3D Object Detection in Adverse Scenes

Arxiv

0+阅读 · 2023年5月18日

Dynamic Matrix Recovery

Arxiv

0+阅读 · 2023年5月17日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Geometric Deep Learning on Molecular Representations

Arxiv

12+阅读 · 2021年7月26日

Low-Shot Learning from Imaginary Data

Arxiv

15+阅读 · 2018年4月3日

相关基金

基于虚拟螺旋运动坐标系的捷联速度算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于视频图像处理的神经导航空间配准方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于稀疏表示的在线视觉跟踪

国家自然科学基金

0+阅读 · 2014年12月31日

高精度流固耦合问题的动画生成方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

中心反折射全景相机标定- - 共形几何代数方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于阴影恢复技术的SAR三维重建与目标检测方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于Voronoi图的动态虚拟场景可见性计算方法

国家自然科学基金

0+阅读 · 2010年12月31日

基于2D视频视觉关注度的3D重建方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

单目移动拍摄下基于隐式形状模型的行人检测方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于领域唯一性彩色编码的实时三维视觉方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员