Transparent objects, such as glass walls and doors, constitute architectural obstacles hindering the mobility of people with low vision or blindness. For instance, the open space behind glass doors is inaccessible, unless it is correctly perceived and interacted with. However, traditional assistive technologies rarely cover the segmentation of these safety-critical transparent objects. In this paper, we build a wearable system with a novel dual-head Transformer for Transparency (Trans4Trans) perception model, which can segment general- and transparent objects. The two dense segmentation results are further combined with depth information in the system to help users navigate safely and assist them to negotiate transparent obstacles. We propose a lightweight Transformer Parsing Module (TPM) to perform multi-scale feature interpretation in the transformer-based decoder. Benefiting from TPM, the double decoders can perform joint learning from corresponding datasets to pursue robustness, meanwhile maintain efficiency on a portable GPU, with negligible calculation increase. The entire Trans4Trans model is constructed in a symmetrical encoder-decoder architecture, which outperforms state-of-the-art methods on the test sets of Stanford2D3D and Trans10K-v2 datasets, obtaining mIoU of 45.13% and 75.14%, respectively. Through a user study and various pre-tests conducted in indoor and outdoor scenes, the usability and reliability of our assistive system have been extensively verified. Meanwhile, the Tran4Trans model has outstanding performances on driving scene datasets. On Cityscapes, ACDC, and DADA-seg datasets corresponding to common environments, adverse weather, and traffic accident scenarios, mIoU scores of 81.5%, 76.3%, and 39.2% are obtained, demonstrating its high efficiency and robustness for real-world transportation applications.
翻译:玻璃墙和门等透明对象构成建筑障碍,妨碍视力低或失明的人的流动。例如,玻璃门后面的开放空间是无法进入的,除非它得到正确的认识和互动。然而,传统辅助技术很少覆盖这些安全关键透明对象的分解。在本文中,我们建立了一个可磨损的系统,配有全新的双头透明变异器(Trans4Trans)感知模型(Trans4Transyer),它可以分割普通和透明对象。两个密集的分解结果与系统内的深度信息进一步结合起来,帮助用户安全导航和谈判透明障碍。我们建议了一个轻量的变异器剖析模块(TPM),用于在基于变异器的解码器的解码器中进行多级地特征解释。 从 TPM 受益的双倍解码解码器可以从相应的数据集中进行联合学习以追求稳健性,同时保持便携式GPUP(Transty)的效能,同时可忽略更多的计算。整个Trans4Trans-Tradefer 模型是在一个对准的编码-coder-decoder 结构中构建一个比对称的, CD CD 和直径的计算系统,在Sdeal-deal-deal-deal-demodal-deal-deal-demodal-deal-deal-deal-deal-deal-deal-deal disal dal disal disal disal disaldaldal-dal-dal-dal-daldaldaldaldaldald-d-daldaldal-d-d-d-d-d-d-d-d-dal-d-d-d-d-d-d-d-d-d-d-d-d-d-d-d-d-dal-disal-dal-d-d-daldaldald-d-d-dal-d-d-d-d-ld-ld-d-d-d-d-d-d-d-d-d-d-d-ld-d-d-d-d-d-d-d-d-d-d-d-