The calibration of low-cost sensors using machine learning techniques is a methodology widely used nowadays. Although many challenges remain to be solved in the deployment of low-cost sensors for air quality monitoring, low-cost sensors have been shown to be useful in conjunction with high-precision instrumentation. Thus, most research is focused on the application of different calibration techniques using machine learning. Nevertheless, the successful application of these models depends on the quality of the data obtained by the sensors, and very little attention has been paid to the whole data gathering process, from sensor sampling and data pre-processing, to the calibration of the sensor itself. In this article, we show the main sensor sampling parameters, with their corresponding impact on the quality of the resulting machine learning-based sensor calibration and their impact on energy consumption, thus showing the existing trade-offs. Finally, the results on an experimental node show the impact of the data sampling strategy in the calibration of tropospheric ozone, nitrogen dioxide and nitrogen monoxide low-cost sensors. Specifically, we show how a sampling strategy that minimizes the duty cycle of the sensing subsystem can reduce power consumption while maintaining data quality.
翻译:使用机器学习技术校准低成本传感器是当今广泛使用的一种方法。虽然在为空气质量监测部署低成本传感器方面仍有许多挑战有待解决,但低成本传感器已证明与高精度仪器一起有用,因此,大多数研究的重点是利用机器学习应用不同的校准技术,然而,成功应用这些模型取决于传感器获得的数据的质量,而且很少注意整个数据收集过程,从传感器取样和数据预处理到传感器校准本身。在本篇文章中,我们展示了主要的传感器取样参数及其对由此产生的机器学习传感器校准质量的相应影响及其对能源消耗的影响,从而显示了现有的权衡。最后,实验节点的结果显示了数据取样战略在对对对对对流层臭氧、氮氧化物和氮一氧化物低成本传感器校准方面的影响。具体地说,我们展示了尽量减少遥感次子的责任周期的取样战略如何在保持数据质量的同时减少电力消耗。