基于边缘计算机辅助智能温室数据流的快速无线感应器异常探测 (Fast Wireless Sensor Anomaly Detection based on Data Stream in Edge Computing Enabled Smart Greenhouse)

Edge computing enabled smart greenhouse is a representative application of Internet of Things technology, which can monitor the environmental information in real time and employ the information to contribute to intelligent decision-making. In the process, anomaly detection for wireless sensor data plays an important role. However, traditional anomaly detection algorithms originally designed for anomaly detection in static data have not properly considered the inherent characteristics of data stream produced by wireless sensor such as infiniteness, correlations and concept drift, which may pose a considerable challenge on anomaly detection based on data stream, and lead to low detection accuracy and efficiency. First, data stream usually generates quickly which means that it is infinite and enormous, so any traditional off-line anomaly detection algorithm that attempts to store the whole dataset or to scan the dataset multiple times for anomaly detection will run out of memory space. Second, there exist correlations among different data streams, which traditional algorithms hardly consider. Third, the underlying data generation process or data distribution may change over time. Thus, traditional anomaly detection algorithms with no model update will lose their effects. Considering these issues, a novel method (called DLSHiForest) on basis of Locality-Sensitive Hashing and time window technique in this paper is proposed to solve these problems while achieving accurate and efficient detection. Comprehensive experiments are executed using real-world agricultural greenhouse dataset to demonstrate the feasibility of our approach. Experimental results show that our proposal is practicable in addressing challenges of traditional anomaly detection while ensuring accuracy and efficiency.

翻译：具有代表性的智能计算功能智能温室是Things Internet的具有代表性的智能温室,它能够实时监测环境信息,并利用信息促进智能决策。在这一过程中,对无线传感器数据进行异常探测具有重要作用。然而,最初为在静态数据中检测异常现象而设计的传统异常探测算法,没有适当考虑由无线传感器产生的数据流的内在特征,如无限性、相关性和概念漂移,这可能对基于数据流的异常探测构成相当大的挑战,并导致检测准确性和效率低。首先,数据流通常会迅速生成,这意味着其无限和巨大,因此,任何传统的离线异常探测算法,试图存储整个数据集或扫描数据集以探测异常现象的多重时间,都将超出记忆空间。第二,不同数据流之间存在关联性,传统算法几乎不考虑。第三,潜在的数据生成过程或数据传播过程可能会随着时间的推移而变化。因此,没有模型更新的传统异常检测算法将失去效果。考虑到这些问题,一种新颖的方法(称为DLShiforest)意味着它无穷无穷无穷无穷无穷无穷,因此,因此,任何传统的异常现象探测法探测法探测法,因此试图存储整个数据集测算算算法将耗竭测算算算算算算算算算算算算法,而要用我们的准确性测算算算算算算法,而要用我们的精确性测法,而保证我们测法的精确性测法,而要用这种测法,而用这种测法,而用这种测法则要用这种测法在保证实际性测法的精确性测算法的精确性测算法,而用这种测算法,而要用这种测法方法,而用这种测算法则在测算算算法,在测算法方法,在保证我们测算法的精确性测算法,在测法的精确性测法,在测法,在保证我们测法性测法性测法的精确测法性测法性测法性测法方法是在测法,而用法性测算法性测法性测法方法,而测法方法是要在测法的精确性地性地性地性地性测法则在保证我们地性测法,用。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

23+阅读 · 2020年11月25日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

38+阅读 · 2020年7月3日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日