In this paper, we consider the IoT data discovery problem in very large and growing scale networks. Through analysis, examples, and experimental studies, we show the importance of peer-to-peer, unstructured routing for IoT data discovery and point out the space efficiency issue that has been overlooked in keyword-based routing algorithms in unstructured networks. Specifically, as the first in the field, this paper investigates routing table designs and various compression techniques to support effective and space-efficient IoT data discovery routing. Novel summarization algorithms, including alphabetical, hash, and meaning-based summarization and their corresponding coding schemes, are proposed. We also consider routing table design to support summarization without degrading lookup efficiency for discovery query routing. The issue of potentially misleading routing due to summarization is also investigated. Subsequently, we analyze the strategy of when to summarize to balance the tradeoff between the routing table compression rate and the chance of causing misleading routing. For the experimental study, we have collected 100K IoT data streams from various IoT databases as the input dataset. Experimental results show that our summarization solution can reduce the routing table size by 20 to 30 folds with a 2-5% increase in latency compared with similar peer-to-peer discovery routing algorithms without summarization. Also, our approach outperforms DHT-based approaches by 2 to 6 folds in terms of latency and traffic.
翻译:在本文中,我们考虑在非常大且日益扩大的规模网络中发现IOT数据的问题。 通过分析、实例和实验研究,我们展示了同行对等、无结构化路径对IOT数据发现的重要性,并指出了在基于关键词的路径算法中被忽视的空间效率问题。具体地说,作为第一个实地,本文件调查了路线设计以及各种压缩技术,以支持有效和空间高效的IOT数据发现路由。通过分析,我们提出了包括字母、散数和基于意义的总和算法及其相应的折叠编码计划在内的新加和算法。我们还考虑在不降低基于关键词的路径算法中支持总和,同时指出在基于关键词的路径算法中忽略了空间效率问题。作为第一个实地的,我们分析了用于平衡基于路径的压缩率和导致误导性路由路由路径的路径选择。关于实验研究,我们收集了来自各种IOT 6 和 5 类比数据数据库的100K 数据流,通过实验性 将20 递增数据格式 将数据转换为20 的折叠式。