Surface normal estimation from a single image is an important task in 3D scene understanding. In this paper, we address two limitations shared by the existing methods: the inability to estimate the aleatoric uncertainty and lack of detail in the prediction. The proposed network estimates the per-pixel surface normal probability distribution. We introduce a new parameterization for the distribution, such that its negative log-likelihood is the angular loss with learned attenuation. The expected value of the angular error is then used as a measure of the aleatoric uncertainty. We also present a novel decoder framework where pixel-wise multi-layer perceptrons are trained on a subset of pixels sampled based on the estimated uncertainty. The proposed uncertainty-guided sampling prevents the bias in training towards large planar surfaces and improves the quality of prediction, especially near object boundaries and on small structures. Experimental results show that the proposed method outperforms the state-of-the-art in ScanNet and NYUv2, and that the estimated uncertainty correlates well with the prediction error. Code is available at https://github.com/baegwangbin/surface_normal_uncertainty.
翻译:对单一图像的表面正常估计是3D 场景理解中的一项重要任务。 在本文中, 我们处理现有方法共有的两个限制: 无法估计异常的不确定性和预测中缺乏详细性。 拟议的网络估计 Per- 像素表面的正常概率分布。 我们为分布引入了新的参数化, 这样它的负日志相似性就是通过学习减缩而导致的角值损失。 角误差的预期值随后被作为偏移不确定性的测量尺度使用 。 我们还提出了一个新的解码框架, 用于根据估计的不确定性抽样对像素子组进行像素的训练。 拟议的不确定性制导取样防止了对大型平面的训练偏差, 提高了预测的质量, 特别是在物体边界附近和小型结构上。 实验结果显示, 拟议的方法超过了扫描网和NYUv2 的状态- 。 估计的不确定性与预测错误密切相关。 https://github. commbin_ commagincremaly_ / commagmbaincrematium.