Machine learning (ML) applications continue to revolutionize many domains. In recent years, there has been considerable research interest in building novel ML applications for a variety of Internet of Things (IoT) domains, such as precision agriculture, smart cities, and smart manufacturing. IoT domains are characterized by continuous streams of data originating from diverse, geographically distributed sensors, and they often require a real-time or semi-real-time response. IoT characteristics pose several fundamental challenges to designing and implementing effective ML applications. Sensor/network failures that result in data stream interruptions is one such challenge. Unfortunately, the performance of many ML applications quickly degrades when faced with data incompleteness. Current techniques to handle data incompleteness are based upon data imputation ( i.e., they try to fill-in missing data). Unfortunately, these techniques may fail, especially when multiple sensors' data streams become concurrently unavailable (due to simultaneous sensor failures). With the aim of building robust IoT-coupled ML applications, this paper proposes SECOE, a unique, proactive approach for alleviating potentially simultaneous sensor failures. The fundamental idea behind SECOE is to create a carefully chosen ensemble of ML models in which each model is trained assuming a set of failed sensors (i.e., the training set omits corresponding values). SECOE includes a novel technique to minimize the number of models in the ensemble by harnessing the correlations among sensors. We demonstrate the efficacy of the SECOE approach through a series of experiments involving three distinct datasets. The experimental findings reveal that SECOE effectively preserves prediction accuracy in the presence of sensor failures.
翻译:机器学习( ML) 应用程序继续使许多领域发生革命性。 近几年来,人们相当有兴趣为各种Tings( IoT) 的互联网领域,如精密农业、智能城市和智能制造等建立新的 ML 应用程序。 IoT 域的特征是来自不同地理分布的传感器的数据源不断流,它们往往需要实时或半实时反应。 IoT 特性对设计和实施有效的 ML 应用程序构成若干基本挑战。 Sensor/net 失败导致数据流中断,这是其中的一项挑战。 不幸的是,许多 ML 应用程序的性能在面临数据不完整时会迅速下降。目前处理数据不完整的技术是以数据估算(即它们试图填充缺失的数据)为基础的。不幸的是,这些技术可能会失败,特别是当多个传感器流同时无法使用时(由于同时的传感器故障) 。 为了建立强大的 IoT coupled ML 应用程序, 本文提出了SECOE, 一种独特的、积极主动的方法来减轻潜在的同步传感器失败。 SECOEEE 的模型背后的一个基本想法是,SCOEVE 的精确地展示了Slovereal sendeal rodustrational rodustration rodude 。