A virtual or digital tour is a form of virtual reality technology which allows a user to experience a specific location remotely. Currently, these virtual tours are created by following a 2-step strategy. First, a photographer clicks a 360 degree equirectangular image; then, a team of annotators manually links these images for the "walkthrough" user experience. The major challenge in the mass adoption of virtual tours is the time and cost involved in manual annotation/linking of images. Therefore, this paper presents an end-to-end pipeline to automate the generation of 3D virtual tours using equirectangular images for real-estate properties. We propose a novel HSV-based coloring scheme for paper tags that need to be placed at different locations before clicking the equirectangular images using 360 degree cameras. These tags have two characteristics: i) they are numbered to help the photographer for placement of tags in sequence and; ii) bi-colored, which allows better learning of tag detection (using YOLOv5 architecture) in an image and digit recognition (using custom MobileNet architecture) tasks. Finally, we link/connect all the equirectangular images based on detected tags. We show the efficiency of the proposed pipeline on a real-world equirectangular image dataset collected from the Housing.com database.
翻译:虚拟或数字巡演是一种虚拟现实技术形式,使用户能够远程体验特定位置。 目前, 这些虚拟巡演是通过一个两步策略创建的。 首先, 摄影师点击360 度正方形图像; 然后, 一个批注员团队将这些图像手工链接到“ 走过” 用户体验中。 大规模采用虚拟巡演的主要挑战在于人工批注/ 链接图像所需的时间和成本。 因此, 本文展示了一条端到端管道, 以便用地产属性的平方形图像自动生成 3D 虚拟巡演。 我们提议为纸牌标记设计一个新的基于 HSV 的彩色方案, 在使用360度摄像头点击正方形图像之前, 需要放置在不同地点。 这些标签有两个特点 : i) 用于帮助摄影师按顺序排列标签; ii 双色, 用于更好地学习标签检测( 使用 YOLOv5 架构), 用于图像和数字识别( 使用自定义的移动网络结构) 。 最后, 我们链接/ 将基于直方形图像数据库 所收集的全方形图像的效率 。