深度NIR:利用深深学习技术生成合成国家清单报告图像和改进水果探测系统的数据集 (deepNIR: Datasets for generating synthetic NIR images and improved fruit detection system using deep learning techniques)

This paper presents datasets utilised for synthetic near-infrared (NIR) image generation and bounding-box level fruit detection systems. It is undeniable that high-calibre machine learning frameworks such as Tensorflow or Pytorch, and large-scale ImageNet or COCO datasets with the aid of accelerated GPU hardware have pushed the limit of machine learning techniques for more than decades. Among these breakthroughs, a high-quality dataset is one of the essential building blocks that can lead to success in model generalisation and the deployment of data-driven deep neural networks. In particular, synthetic data generation tasks often require more training samples than other supervised approaches. Therefore, in this paper, we share the NIR+RGB datasets that are re-processed from two public datasets (i.e., nirscene and SEN12MS) and our novel NIR+RGB sweet pepper(capsicum) dataset. We quantitatively and qualitatively demonstrate that these NIR+RGB datasets are sufficient to be used for synthetic NIR image generation. We achieved Frechet Inception Distance (FID) of 11.36, 26.53, and 40.15 for nirscene1, SEN12MS, and sweet pepper datasets respectively. In addition, we release manual annotations of 11 fruit bounding boxes that can be exported as various formats using cloud service. Four newly added fruits [blueberry, cherry, kiwi, and wheat] compound 11 novel bounding box datasets on top of our previous work presented in the deepFruits project [apple, avocado, capsicum, mango, orange, rockmelon, strawberry]. The total number of bounding box instances of the dataset is 162k and it is ready to use from cloud service. For the evaluation of the dataset, Yolov5 single stage detector is exploited and reported impressive mean-average-precision,mAP[0.5:0.95] results of[min:0.49, max:0.812]. We hope these datasets are useful and serve as a baseline for the future studies.

翻译：本文展示了用于合成近红外(NIR)图像生成的数据集。不可否认的是, 高口径的机器学习框架, 如Tensorflow或Pytorch, 以及大型图像网或COCO数据集, 在加速GPU硬件的帮助下, 将机器学习技术的极限推到了几十年以上。在这些突破中, 高质量的数据集是能够成功模型化和部署数据驱动的深层神经网络的基本构件之一。特别是, 合成数据生成任务通常需要比其他监管方法更多的培训样本。因此, 我们分享了 NIR+RGB数据集, 以及由两个公共数据集( 即, 硝基和 SEN12MS ) 重新处理的大型图像网络数据集。高级的NIR+RGB 辣椒( 树脂) 数据集。我们定量和定性地显示这些 NIR+RGB数据集足以用于合成的40个图像生成。我们实现了Frechelder Rickeral 工作, 以及 IMIS IMS 数据输出 11. 36 数据, 我们的最近的Olex 数据输出数据是。