Recently, the EAGL-I system was developed to rapidly create massive labeled datasets of plants intended to be commonly used by farmers and researchers to create AI-driven solutions in agriculture. As a result, a publicly available plant species recognition dataset composed of 40,000 images with different sizes consisting of 8 plant species was created with the system in order to demonstrate its capabilities. This paper proposes a novel method, called Variably Overlapping Time-Coherent Sliding Window (VOTCSW), that transforms a dataset composed of images with variable size to a 3D representation with fixed size that is suitable for convolutional neural networks, and demonstrates that this representation is more informative than resizing the images of the dataset to a given size. We theoretically formalized the use cases of the method as well as its inherent properties and we proved that it has an oversampling and a regularization effect on the data. By combining the VOTCSW method with the 3D extension of a recently proposed machine learning model called 1-Dimensional Polynomial Neural Networks, we were able to create a model that achieved a state-of-the-art accuracy of 99.9% on the dataset created by the EAGL-I system, surpassing well-known architectures such as ResNet and Inception. In addition, we created a heuristic algorithm that enables the degree reduction of any pre-trained N-Dimensional Polynomial Neural Network and which compresses it without altering its performance, thus making the model faster and lighter. Furthermore, we established that the currently available dataset could not be used for machine learning in its present form, due to a substantial class imbalance between the training set and the test set. Hence, we created a specific preprocessing and a model development framework that enabled us to improve the accuracy from 49.23% to 99.9%.
翻译:最近,EAGL- I 系统开发了快速创建大规模有标签的植物数据集的EAGL-I 系统,该系统旨在快速创建大规模有标签的植物数据集,这些植物将供农民和研究人员普遍使用,以在农业中创建由AI驱动的解决方案。结果,一个由40,000个不同大小的图像组成的公开的植物物种识别数据集,由8个植物物种组成,与该系统一起创建,以展示其能力。本文提出了一个创新方法,名为“易变重叠时间-一致滑动窗口”(VOTCSW),将一个由具有可变体积大小的图像组成的数据集转换成3D 代表器,该数据集的大小为3D,适合卷心神经神经网络网络网络的固定大小,并表明这一表示比将数据集的图像重新缩放到一定大小更为丰富信息。我们理论上将使用该方法的系统及其固有属性正式化,并证明它具有过量的测试效果。 通过将VOTCSW方法与最近提出的3D模型模型的扩展,该模型称为1-D-DIPolnal Comnonial Colnomal Neal netweal Net, 网络可以创建一个模型的模型,我们可以在不具有较轻的模型上实现一个模型的模型的模型。