Linear regression models, especially the extended STIRPAT model, are routinely-applied for analyzing carbon emissions data. However, since the relationship between carbon emissions and the influencing factors is complex, fitting a simple parametric model may not be an ideal solution. This paper investigated various nonparametric approaches in statistics and machine learning (ML) for modeling carbon emissions data, including kernel regression, random forest and neural network. We selected data from ten Chinese cities from 2005 to 2019 for modeling studies. We found that neural network had the best performance in both fitting and prediction accuracy, which implies its capability of expressing the complex relationships between carbon emissions and the influencing factors. This study provides a new means for quantitative modeling of carbon emissions research that helps to understand how to characterize urban carbon emissions and to propose policy recommendations for "carbon reduction". In addition, we used the carbon emissions data of Wuhu city as an example to illustrate how to use this new approach.
翻译:线性回归模型,特别是STIRPAT模型扩展,通常应用于分析碳排放数据。然而,由于碳排放和影响因素之间的关系复杂,拟合简单参数模型可能不是理想的解决方案。本文调查了统计学和机器学习中的各种非参数方法来建模碳排放数据,包括核回归,随机森林和神经网络。我们选择了从2005年到2019年间的十个中国城市的数据进行建模研究。我们发现神经网络具有最佳的拟合和预测精度,这表明了它表达复杂的碳排放和影响因素关系的能力。本研究提供了一种用于量化建模碳排放研究的新方法,有助于了解如何表征城市碳排放并提出“减碳”政策建议。此外,我们以芜湖市的碳排放数据为例说明如何使用这种新方法。