In recent years, due to the wider WiFi coverage and the popularization of mobile communication devices, the technology of indoor positioning using WiFi fingerprints has been rapidly developed. Currently, most supervised methods need to collect a large amount of data to construct fingerprint datasets, which is labor-intensive and time-consuming. In addition, many studies focused on the ideal laboratory environment and lack the consideration in the practical application environment, especially in the scenario of multiple large multi-floor buildings. To solve these problems, we proposed a novel WiDAGCN model which can be trained by a few labeled site survey data and unlabeled crowdsensing WiFi fingerprints. To comprehensively represent the topology structure of the data, we constructed heterogeneous graphs according to the received signal strength indicators (RSSIs) between the waypoints and WiFi access points (APs). Moreover, previous WiFi indoor localization studies rarely involved complete graph feature representation, thus we use graph convolutional network (GCN) to extract graph-level embeddings. There are also some difficult problems, for example, a large amount of unlabeled data that cannot be applied to a supervised model, and the existence of multiple data domains leads to inconsistency in data distribution. Therefore, a semi-supervised domain adversarial training scheme was used to make full use of unlabeled data and align the data distribution of different domains. A public indoor localization dataset containing different buildings was used to evaluate the performance of the model. The experimental results show that our system can achieve a competitive localization accuracy in large buildings such as shopping malls.
翻译:近年来,由于WiFi覆盖面扩大,移动通信设备普及,使用WiFi指纹的室内定位技术迅速得到迅速发展,目前,最受监督的方法需要收集大量数据,以建立指纹数据集,这是劳动密集型和耗时的。此外,许多研究侧重于理想的实验室环境,在实际应用环境中缺乏考虑,特别是在多个大型多层建筑的假设中,为了解决这些问题,我们建议了一个新的WiDAGCN模型,该模型可由少数有标签的现场调查数据和未贴标签的人群WiFi指纹来培训。为了全面反映数据的表层结构,我们根据收到的信号强度指标(RSSIs)和WiFi接入点(APs)之间构建了多种混杂的图形数据集。此外,WiFi的室内本地本地化研究很少涉及完整的图形特征代表,因此我们使用图变动网络来提取模型级嵌入。还存在一些困难,例如,大量未贴标签的数据无法用于监督的本地采购数据,因此,在多层次的市际结构中使用了多种数据分布模式。