The last decade has seen an explosion in data sources available for the monitoring and prediction of environmental phenomena. While several inferential methods have been developed that make predictions on the underlying process by combining these data, an optimal sampling design for when additional data is needed to complement those from other heterogeneous sources has not yet been developed. Here, we provide an adaptive spatial design strategy based on a utility function that combines both prediction uncertainty and risk-factor criteria. Prediction uncertainty is obtained through a spatial data fusion approach based on fixed rank kriging that can tackle data with differing spatial supports and signal-to-noise ratios. We focus on the application of low-cost portable sensors, which tend to be relatively noisy, for air pollution monitoring, where data from regulatory stations as well as numeric modeling systems are also available. Although we find that spatial adaptive sampling designs can help to improve predictions and reduce prediction uncertainty, low-cost portable sensors are only likely to be beneficial if they are sufficient in number and quality. Our conclusions are based on a multi-factorial simulation experiment, and on a realistic simulation of pollutants in the Erie and Niagara counties in Western New York.
翻译:过去十年来,用于监测和预测环境现象的现有数据源出现了爆炸性的变化。虽然已经开发了几种预测方法,通过合并这些数据对基本过程作出预测,但还没有为补充来自其他不同来源的数据而需要额外数据时的最佳抽样设计。在这里,我们提供了一种适应性空间设计战略,其基础是结合预测不确定性和风险因素标准的实用功能。预测不确定性是通过基于固定等级的空间数据聚合方法获得的,该方法能够以不同的空间支持和信号到音频比率处理数据。我们侧重于应用低成本便携式传感器,这种传感器往往比较吵闹,用于空气污染监测,其中也有来自监管站和数字模型系统的数据。虽然我们认为空间适应性取样设计有助于改进预测和减少预测不确定性,但低成本的便携式传感器只有在数量和质量都足够的情况下才可能有所助益。我们的结论基于多层次的模拟实验,以及纽约西部Erie和Niagara县的污染物现实模拟。</s>