Monitoring pesticide concentration is very important for public authorities given the major concerns for environmental safety and the likelihood for increased public health risks. An important aspect of this process consists in locating abnormal signals, from a large amount of collected data. This kind of data is usually complex since it suffers from limits of quantification leading to left censored observations, and from the sampling procedure which is irregular in time and space across measuring stations. The present manuscript tackles precisely the issue of detecting spatio-temporal collective anomalies in pesticide concentration levels, and introduces a novel methodology for dealing with spatio-temporal heterogeneity. The latter combines a change-point detection procedure applied to the series of maximum daily values across all stations, and a clustering step aimed at a spatial segmentation of the stations. Limits of quantification are handled in the change-point procedure, by supposing an underlying left-censored parametric model, piece-wise stationary. Spatial segmentation takes into account the geographical conditions, and may be based on river network, wind directions, etc. Conditionally to the temporal segment and the spatial cluster, one may eventually analyse the data and identify contextual anomalies. The proposed procedure is illustrated in detail on a data set containing the prosulfocarb concentration levels in surface waters in Centre-Val de Loire region.
翻译:监测农药浓度对于公共当局非常重要,因为人们对环境安全有重大关切,而且有可能增加公共卫生风险,因此,监测农药浓度对于公共当局非常重要。这一进程的一个重要方面是,从大量收集的数据中找到异常信号,这种数据通常比较复杂,因为它受到量化的限制,导致左侧受审查的观测,而且来自测量台站在时间和空间上不规则的抽样程序。本稿精确地处理在农药浓度水平上发现时空集体异常的问题,并采用新的方法处理时空异质,后者结合了对所有台站每日最高值系列应用的变化点检测程序,以及旨在对台站进行空间分割的集群步骤。量化限制在变化点程序中处理,办法是假定基本的左上层参数参数模型、片度静止。空间分解考虑到地理条件,并可能基于河流网络、风向等。对于时间段和空间集群,最终可以分析数据,并查明空间集群的表面偏差。拟议程序在地面区域中绘制了数据图示,其中含有地面偏差的详细度。