PanGEA, the Panoramic Graph Environment Annotation toolkit, is a lightweight toolkit for collecting speech and text annotations in photo-realistic 3D environments. PanGEA immerses annotators in a web-based simulation and allows them to move around easily as they speak and/or listen. It includes database and cloud storage integration, plus utilities for automatically aligning recorded speech with manual transcriptions and the virtual pose of the annotators. Out of the box, PanGEA supports two tasks -- collecting navigation instructions and navigation instruction following -- and it could be easily adapted for annotating walking tours, finding and labeling landmarks or objects, and similar tasks. We share best practices learned from using PanGEA in a 20,000 hour annotation effort to collect the Room-Across-Room dataset. We hope that our open-source annotation toolkit and insights will both expedite future data collection efforts and spur innovation on the kinds of grounded language tasks such environments can support.
翻译:PANGEA是全景图环境说明工具包,是收集照片现实的3D环境中的言语和文字说明的轻便工具。 PANGEA在网上模拟中浸入说明者,使他们在讲话和/或倾听时能够轻松地四处移动。它包括数据库和云存储整合,加上自动将记录的发言与人工抄录和批注者虚拟面貌统一起来的公用事业。在盒子中,PANGEA支持两项任务 -- -- 收集导航指令和导航指令之后 -- -- 并且可以很容易地适应于作说明的步行旅行、查找和标注标志标志或物体以及类似任务。我们分享了在20 000小时内利用PANGEA收集会议室跨楼数据集的最佳做法。我们希望,我们的公开源说明工具包和洞察力将加快未来的数据收集工作,并促进对此类环境能够支持的有根语言任务的创新。