Differentiable renderers provide a direct mathematical link between an object's 3D representation and images of that object. In this work, we develop an approximate differentiable renderer for a compact, interpretable representation, which we call Fuzzy Metaballs. Our approximate renderer focuses on rendering shapes via depth maps and silhouettes. It sacrifices fidelity for utility, producing fast runtimes and high-quality gradient information that can be used to solve vision tasks. Compared to mesh-based differentiable renderers, our method has forward passes that are 5x faster and backwards passes that are 30x faster. The depth maps and silhouette images generated by our method are smooth and defined everywhere. In our evaluation of differentiable renderers for pose estimation, we show that our method is the only one comparable to classic techniques. In shape from silhouette, our method performs well using only gradient descent and a per-pixel loss, without any surrogate losses or regularization. These reconstructions work well even on natural video sequences with segmentation artifacts. Project page: https://leonidk.github.io/fuzzy-metaballs
翻译:可区别的转换器在对象的 3D 表达式和该对象的图像之间提供直接的数学链接。 在这项工作中, 我们为一个缩略、 可解释的缩略图开发了一个大致不同的转换器, 我们称之为 Fuzzy Metaball 。 我们的缩略图侧重于通过深度地图和硅面板来显示形状。 它为实用性牺牲了忠实性, 生成快速运行时间和高品质的梯度信息, 可用于解决视觉任务。 与基于网状的可区分的转换器相比, 我们的方法具有前方传送器, 速速为 30x 的 5x 快速和后向传递器。 我们的方法产生的深度地图和硅面图图像是平滑的, 并且定义无处不在。 在我们对可变形图的缩放器的评估中, 我们显示我们的方法是唯一与经典技术相似的方法。 从硅面的形状中, 我们的方法运行得很好, 仅使用梯度下降和每平流体损失, 没有任何子损失或正规化。 这些重建方法在自然视频序列上运作良好, 甚至与断段制手工艺。 项目页面: https://s://leonid-leonidk.giball. gust./falls.