The Multiplane Image (MPI), containing a set of fronto-parallel RGBA layers, is an effective and efficient representation for view synthesis from sparse inputs. Yet, its fixed structure limits the performance, especially for surfaces imaged at oblique angles. We introduce the Structural MPI (S-MPI), where the plane structure approximates 3D scenes concisely. Conveying RGBA contexts with geometrically-faithful structures, the S-MPI directly bridges view synthesis and 3D reconstruction. It can not only overcome the critical limitations of MPI, i.e., discretization artifacts from sloped surfaces and abuse of redundant layers, and can also acquire planar 3D reconstruction. Despite the intuition and demand of applying S-MPI, great challenges are introduced, e.g., high-fidelity approximation for both RGBA layers and plane poses, multi-view consistency, non-planar regions modeling, and efficient rendering with intersected planes. Accordingly, we propose a transformer-based network based on a segmentation model. It predicts compact and expressive S-MPI layers with their corresponding masks, poses, and RGBA contexts. Non-planar regions are inclusively handled as a special case in our unified framework. Multi-view consistency is ensured by sharing global proxy embeddings, which encode plane-level features covering the complete 3D scenes with aligned coordinates. Intensive experiments show that our method outperforms both previous state-of-the-art MPI-based view synthesis methods and planar reconstruction methods.
翻译:多平面图象(MPI)包含一组有几何信仰的结构、S-MPI直接连接的图像合成和3D重建。它不仅能够克服MPI(即从斜面表面分离的文物和滥用冗余层进行分解)的重大局限性,而且能够进行平面重建。我们引入了结构MPI(S-MPI),其中平面结构近似3D场景。将RGBA环境与几何信仰结构、S-MPI直接连接的图像合成和3D重建相结合。它不仅能够克服MPI(即从斜面上分离的文物和滥用多余层的分解)的关键局限性,而且能够限制其性能。尽管应用S-MPI的直观和需求,但我们引入了3D型平面结构(S-MPI的直观和直观)的性能功能。它预测了常规和直观的S-MPI(S-MPI)系统化的模拟网络,同时通过S-MPI(S-D)的平面图象框架,通过S-BI(S-devical-deal-deal-de-de-de-de-de-comlical-de-de-commal-commal-commal-commal-slation-slation)框架,通过S-smal-commal-commal-commal-commal-commal-commal-s)的系统管理了我们方位图解。我们方框架,通过S-S-smal-smal-smal-s</s>