Mapping and monitoring crops is a key step towards sustainable intensification of agriculture and addressing global food security. A dataset like ImageNet that revolutionized computer vision applications can accelerate development of novel crop mapping techniques. Currently, the United States Department of Agriculture (USDA) annually releases the Cropland Data Layer (CDL) which contains crop labels at 30m resolution for the entire United States of America. While CDL is state of the art and is widely used for a number of agricultural applications, it has a number of limitations (e.g., pixelated errors, labels carried over from previous errors and absence of input imagery along with class labels). In this work, we create a new semantic segmentation benchmark dataset, which we call CalCROP21, for the diverse crops in the Central Valley region of California at 10m spatial resolution using a Google Earth Engine based robust image processing pipeline and a novel attention based spatio-temporal semantic segmentation algorithm STATT. STATT uses re-sampled (interpolated) CDL labels for training, but is able to generate a better prediction than CDL by leveraging spatial and temporal patterns in Sentinel2 multi-spectral image series to effectively capture phenologic differences amongst crops and uses attention to reduce the impact of clouds and other atmospheric disturbances. We also present a comprehensive evaluation to show that STATT has significantly better results when compared to the resampled CDL labels. We have released the dataset and the processing pipeline code for generating the benchmark dataset.
翻译:作物测绘和监测是可持续加强农业和解决全球粮食安全问题的关键步骤。像图像网这样的数据集,使计算机视野应用革命化的图像网可以加速开发新型作物绘图技术。目前,美国农业部每年发布作物土地数据图(CDL),其中载有全美国30米分辨率的作物标签。虽然CDL是最新工艺,广泛用于若干农业应用,但它有一些局限性(例如,分解错误、从以往错误中结转的标签和缺少与阶级标签一起输入图像的标签),在这项工作中,我们创建了一个新的语义分割基准数据集,我们称之为CalCROP21, 用于加利福尼亚州中部谷地地区10米空间分辨率的多种作物,使用谷歌地球引擎的稳健图像处理管道和新颖的注意力,但STAT使用重印的(内插的)CDL标签标签来进行培训,但是在利用空间和时,我们能够比CDL更准确地预测了SCDL的图像处理结果。我们利用了SentL的图像模型和时间序列模型来显著地显示目前对大气-脉冲的磁带数据流流流学的关注。