Anomaly detection is increasingly important to handle the amount of sensor data in Edge and Fog environments, Smart Cities, as well as in Industry 4.0. To ensure good results, the utilized ML models need to be updated periodically to adapt to seasonal changes and concept drifts in the sensor data. Although the increasing resource availability at the edge can allow for in-situ execution of model training directly on the devices, it is still often offloaded to fog devices or the cloud. In this paper, we propose Local-Optimistic Scheduling (LOS), a method for executing periodic ML model training jobs in close proximity to the data sources, without overloading lightweight edge devices. Training jobs are offloaded to nearby neighbor nodes as necessary and the resource consumption is optimized to meet the training period while still ensuring enough resources for further training executions. This scheduling is accomplished in a decentralized, collaborative and opportunistic manner, without full knowledge of the infrastructure and workload. We evaluated our method in an edge computing testbed on real-world datasets. The experimental results show that LOS places the training executions close to the input sensor streams, decreases the deviation between training time and training period by up to 40% and increases the amount of successfully scheduled training jobs compared to an in-situ execution.
翻译:异常探测对于处理边缘和雾、智能城市以及工业4.0环境中的传感器数据量越来越重要。 为确保取得良好结果,需要定期更新利用的 ML 模型,以适应传感器数据中的季节变化和概念漂移。虽然边缘地区越来越多的资源可以直接在设备上进行现场示范培训,但是仍然经常被卸到雾装置或云层中。在本文中,我们建议采用地方-优化调度(LOS)方法,即执行定期 ML 模式培训工作的方法,接近数据源,而不会超载轻重量边缘装置。培训工作视需要卸载到邻近的节点,并且优化资源消耗以满足培训期的需要,同时仍然确保有足够的资源用于进一步培训处决。这一时间安排是以分散、协作和机会的方式完成的,没有完全了解基础设施和工作量。我们在现实世界数据集的边缘计算测试中评估了我们的方法。实验结果表明,LLOS将培训处决工作贴近输入感官流,在培训时间和训练期之间的偏差率将降低到40个培训期,比培训时间和培训期提高了40个比例。