Plant biomass estimation is critical due to the variability of different environmental factors and crop management practices associated with it. The assessment is largely impacted by the accurate prediction of different environmental sustainability indicators. A robust model to predict sustainability indicators is a must for the biomass community. This study proposes a robust model for biomass sustainability prediction by analyzing sustainability indicators using machine learning models. The prospect of ensemble learning was also investigated to analyze the regression problem. All experiments were carried out on a crop residue data from the Ohio state. Ten machine learning models, namely, linear regression, ridge regression, multilayer perceptron, k-nearest neighbors, support vector machine, decision tree, gradient boosting, random forest, stacking and voting, were analyzed to estimate three biomass sustainability indicators, namely soil erosion factor, soil conditioning index, and organic matter factor. The performance of the model was assessed using cross-correlation (R2), root mean squared error and mean absolute error metrics. The results showed that Random Forest was the best performing model to assess sustainability indicators. The analyzed model can now serve as a guide for assessing sustainability indicators in real time.
翻译:由于不同环境因素和与之相关的作物管理做法的可变性,植物生物量估计至关重要。评估在很大程度上受到不同环境可持续性指标准确预测的影响。预测可持续性指标的稳健模型是生物量群体必须具备的。本研究通过利用机器学习模型分析可持续性指标,提出生物量可持续性预测的稳健模型。还调查了共同学习的前景,以分析回归问题。所有实验都是根据俄亥俄州的作物残留数据进行的。十个机器学习模型,即线性回归、脊柱回归、多层过敏模型、K-近邻、支持矢量机、决定树、梯度加速、随机森林、堆叠和投票模型,经过分析后估计了三种生物量可持续性指标,即土壤侵蚀系数、土壤调节指数和有机物系数。模型的性能是通过交叉交错关系(R2)、根正方差错误和平均绝对误差衡量尺度评估的。结果显示,随机森林是评估可持续性指标的最佳执行模型。分析模型现在可以作为实时评估可持续性指标的指南。