神经辐射场（NeRF）3D分析：HoloLens轨迹和运动结构相机姿势的比较研究 (A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion) - 专知论文

会员服务 ·

0

NeRF · 3D重建 · HoloLens · 重建 · 辐射场 ·

2023 年 4 月 20 日

A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion

翻译：神经辐射场（NeRF）3D分析：HoloLens轨迹和运动结构相机姿势的比较研究

Miriam Jäger,Patrick Hübner,Dennis Haitz,Boris Jutzi

from arxiv, 7 pages, 5 figures. Will be published in the ISPRS The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences

Neural Radiance Fields (NeRFs) are trained using a set of camera poses and associated images as input to estimate density and color values for each position. The position-dependent density learning is of particular interest for photogrammetry, enabling 3D reconstruction by querying and filtering the NeRF coordinate system based on the object density. While traditional methods like Structure from Motion are commonly used for camera pose calculation in pre-processing for NeRFs, the HoloLens offers an interesting interface for extracting the required input data directly. We present a workflow for high-resolution 3D reconstructions almost directly from HoloLens data using NeRFs. Thereby, different investigations are considered: Internal camera poses from the HoloLens trajectory via a server application, and external camera poses from Structure from Motion, both with an enhanced variant applied through pose refinement. Results show that the internal camera poses lead to NeRF convergence with a PSNR of 25\,dB with a simple rotation around the x-axis and enable a 3D reconstruction. Pose refinement enables comparable quality compared to external camera poses, resulting in improved training process with a PSNR of 27\,dB and a better 3D reconstruction. Overall, NeRF reconstructions outperform the conventional photogrammetric dense reconstruction using Multi-View Stereo in terms of completeness and level of detail.

翻译：神经辐射场（NeRF）通过一组摄像头姿势和相关图像作为输入进行训练，以预测每个位置的密度和颜色值。这种依赖位置的密度学习对光测学特别有用，通过查询和过滤基于对象密度的NeRF坐标系实现3D重建。虽然结构从运动等传统方法通常用于NeRF的预处理中，但HoloLens提供了一种直接提取所需输入数据的有趣接口。我们提出了一个使用NeRF从HoloLens数据直接进行高分辨率3D重建的工作流。我们考虑了不同的调查：通过服务器应用程序从HoloLens轨迹获取的内部相机姿势以及通过姿势细化应用的改进型的外部相机姿势。结果表明，内部相机姿势可导致围绕x轴的简单旋转导致NeRF收敛时达到25 dB的峰值信噪比，从而实现了3D重建。姿势细化使得与外部相机姿势相比具有可比质量，结果为27 dB的峰值信噪比，并实现了更好的3D重建。总体而言，NeRF重建在完整性和细节水平方面优于使用多视图立体视觉进行的传统光学密集重建。

0

相关内容

NeRF

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

专知会员服务

25+阅读 · 2022年3月3日

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

专知会员服务

25+阅读 · 2020年7月1日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

29+阅读 · 2020年5月19日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

专知会员服务

29+阅读 · 2020年3月26日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【论文】结构GANs，Structured GANs，

【论文】结构GANs，Structured GANs，

专知会员服务

15+阅读 · 2020年1月16日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

3D Human相关研究总结：人体、姿态估计、人体重建等

3D Human相关研究总结：人体、姿态估计、人体重建等

PaperWeekly

27+阅读 · 2021年3月1日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

泡泡机器人SLAM

10+阅读 · 2018年10月7日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

航空叶片多光学传感器多尺度测量点云高效拼合方法

国家自然科学基金

0+阅读 · 2015年12月31日

直线扫描内CT及其动态Bowtie研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于全向深度视觉的高精度人体肢体运动三维重建研究

国家自然科学基金

0+阅读 · 2014年12月31日

量子调控高铝组分氮化物基深紫外LED光提取效率

国家自然科学基金

0+阅读 · 2013年12月31日

双能谱CT迭代重建算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

面向虚拟手术的人体肝脏物理建模方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

城市地区形变测量中的多源传感器四维SAR层析成像

国家自然科学基金

0+阅读 · 2011年12月31日

非圆（任意）轨道锥形投影SPECT解析重建算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于局部不变性特征流的相异场景密集匹配

国家自然科学基金

0+阅读 · 2011年12月31日

基于领域唯一性彩色编码的实时三维视觉方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

Learning Human Mesh Recovery in 3D Scenes

Arxiv

0+阅读 · 2023年6月6日

Deep Active Learning with Structured Neural Depth Search

Arxiv

0+阅读 · 2023年6月5日

Stable Motion Primitives via Imitation and Contrastive Learning

Arxiv

0+阅读 · 2023年6月5日

MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Pose

Arxiv

0+阅读 · 2023年6月4日

PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas

Arxiv

0+阅读 · 2023年6月2日

Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation

Arxiv

0+阅读 · 2023年6月2日

Learning Signed Distance Functions from Noisy 3D Point Clouds via Noise to Noise Mapping

Arxiv

0+阅读 · 2023年6月2日

Pedestrian Crossing Action Recognition and Trajectory Prediction with 3D Human Keypoints

Arxiv

0+阅读 · 2023年6月1日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

专知会员服务

25+阅读 · 2022年3月3日

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

专知会员服务

25+阅读 · 2020年7月1日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

29+阅读 · 2020年5月19日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

专知会员服务

29+阅读 · 2020年3月26日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【论文】结构GANs，Structured GANs，

【论文】结构GANs，Structured GANs，

专知会员服务

15+阅读 · 2020年1月16日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

人机协同时代的军事指挥控制演进

《英国智库：瓦解俄罗斯防空系统生产，夺回制空权》最新报告

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

《战术突击工具包：军队的“边缘”操作系统》报告

相关资讯

3D Human相关研究总结：人体、姿态估计、人体重建等

3D Human相关研究总结：人体、姿态估计、人体重建等

PaperWeekly

27+阅读 · 2021年3月1日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

【泡泡一分钟】用于深度双目的非监督适应方法(ICCV-2017)

泡泡机器人SLAM

10+阅读 · 2018年10月7日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

相关论文

Learning Human Mesh Recovery in 3D Scenes

Arxiv

0+阅读 · 2023年6月6日

Deep Active Learning with Structured Neural Depth Search

Arxiv

0+阅读 · 2023年6月5日

Stable Motion Primitives via Imitation and Contrastive Learning

Arxiv

0+阅读 · 2023年6月5日

MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Pose

Arxiv

0+阅读 · 2023年6月4日

PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas

Arxiv

0+阅读 · 2023年6月2日

Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation

Arxiv

0+阅读 · 2023年6月2日

Learning Signed Distance Functions from Noisy 3D Point Clouds via Noise to Noise Mapping

Arxiv

0+阅读 · 2023年6月2日

Pedestrian Crossing Action Recognition and Trajectory Prediction with 3D Human Keypoints

Arxiv

0+阅读 · 2023年6月1日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

相关基金

航空叶片多光学传感器多尺度测量点云高效拼合方法

国家自然科学基金

0+阅读 · 2015年12月31日

直线扫描内CT及其动态Bowtie研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于全向深度视觉的高精度人体肢体运动三维重建研究

国家自然科学基金

0+阅读 · 2014年12月31日

量子调控高铝组分氮化物基深紫外LED光提取效率

国家自然科学基金

0+阅读 · 2013年12月31日

双能谱CT迭代重建算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

面向虚拟手术的人体肝脏物理建模方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

城市地区形变测量中的多源传感器四维SAR层析成像

国家自然科学基金

0+阅读 · 2011年12月31日

非圆（任意）轨道锥形投影SPECT解析重建算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于局部不变性特征流的相异场景密集匹配

国家自然科学基金

0+阅读 · 2011年12月31日

基于领域唯一性彩色编码的实时三维视觉方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员