Feature representation learning is the key recipe for learning-based Multi-View Stereo (MVS). As the common feature extractor of learning-based MVS, vanilla Feature Pyramid Networks (FPN) suffers from discouraged feature representations for reflection and texture-less areas, which limits the generalization of MVS. Even FPNs worked with pre-trained Convolutional Neural Networks (CNNs) fail to tackle these issues. On the other hand, Vision Transformers (ViTs) have achieved prominent success in many 2D vision tasks. Thus we ask whether ViTs can facilitate feature learning in MVS? In this paper, we propose a pre-trained ViT enhanced MVS network called MVSFormer, which can learn more reliable feature representations benefited by informative priors from ViT. Then MVSFormer-P and MVSFormer-H are further proposed with freezed ViT weights and trainable ones respectively. MVSFormer-P is more efficient while MVSFormer-H can achieve superior performance. MVSFormer can be generalized to various input resolutions with the efficient multi-scale training strengthened by gradient accumulation. Moreover, we discuss the merits and drawbacks of classification and regression-based MVS methods, and further propose to unify them with a temperature-based strategy. MVSFormer achieves state-of-the-art performance on the DTU dataset. Particularly, our anonymous submission of MVSFormer is ranked in the Top-1 position on both intermediate and advanced sets of the highly competitive Tanks-and-Temples leaderboard on the day of submission compared with other published works. Codes and models will be released soon.
翻译:以学习为基础的多视系统(MVS)的特征学习是学习性能学习的多视系统(MVS)的关键路由。 由于学习性的MVS(VVT)的共同特征提取器,Vanilla Fature Pyramid网络(FPN)在反射和无纹带地区有令人气馁的特征展示器,这限制了MVS的普及。即使是FPNs也未能解决这些问题。另一方面,愿景变换器(VTs)在许多2D愿景任务中取得了显著成功。因此,我们询问VTs能否促进MVS(VS)的特征学习?在本文件中,我们提议建立一个事先经过训练的VIVS强化的MVS网络(MVS Formerermations),这个网络可以学习更多可靠的特征展示,而VIVS-P和MVS Former-H则进一步提出以冻结的VT重量和训练为基础的标准。MVSF-Feral-S(MVS)的高级性能化和快速变现变现、S(S)的快速变现和不断变现、不断升级的变现、不断变现、不断变现、不断变现、不断的变现、不断的变现、变现、不断变现、不断变现、不断变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的进度、不断的变现、不断变现、不断变现、不断变现、不断变现、不断变现、不断变现、不断的变现、不断的变现、不断的变、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变、不断的变、不断的变、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现、不断的变现的变现、不断的变现、不断的变现、不断的变现、不断的