In recent years WiFi became the primary source of information to locate a person or device indoor. Collecting RSSI values as reference measurements with known positions, known as WiFi fingerprinting, is commonly used in various positioning methods and algorithms that appear in literature. However, measuring the spatial distance between given set of WiFi fingerprints is heavily affected by the selection of the signal distance function used to model signal space as geospatial distance. In this study, the authors proposed utilization of machine learning to improve the estimation of geospatial distance between fingerprints. This research examined data collected from 13 different open datasets to provide a broad representation aiming for general model that can be used in any indoor environment. The proposed novel approach extracted data features by examining a set of commonly used signal distance metrics via feature selection process that includes feature analysis and genetic algorithm. To demonstrate that the output of this research is venue independent, all models were tested on datasets previously excluded during the training and validation phase. Finally, various machine learning algorithms were compared using wide variety of evaluation metrics including ability to scale out the test bed to real world unsolicited datasets.
翻译:近年来, WiFi 成为内部定位某人或装置的主要信息来源。 收集RSSI 值作为已知位置的参考测量标准,称为 WiFi 指纹,在文献中出现的各种定位方法和算法中常用; 然而,测量特定一组 WiFi 指纹之间的空间距离,由于选择用于模拟信号空间的信号距离功能作为地理空间距离的信号功能而大受影响。 在这项研究中,作者建议利用机器学习来改进对指纹之间地理空间距离的估计。 这项研究审查了13个不同的开放数据集收集的数据,以提供广泛的代表性,旨在为任何室内环境中可以使用的一般模型提供广泛的代表性。 拟议的新办法通过通过特征选择过程,包括特征分析和遗传算法,对一套通用信号距离测量标准进行抽取数据特征。 为了证明这一研究的产出是独立的,所有模型都经过先前在培训和验证阶段排除的数据集的测试。 最后,各种机器学习算法使用各种各样的评价指标进行了比较,包括将测试床缩放到真实世界的不要求的数据集的能力。