Data-driven methods for battery lifetime prediction are attracting increasing attention for applications in which the degradation mechanisms are poorly understood and suitable training sets are available. However, while advanced machine learning and deep learning methods offer high performance with minimal feature engineering, simpler "statistical learning" methods often achieve comparable performance, especially for small training sets, while also providing physical and statistical interpretability. In this work, we use a previously published dataset to develop simple, accurate, and interpretable data-driven models for battery lifetime prediction. We first present the "capacity matrix" concept as a compact representation of battery electrochemical cycling data, along with a series of feature representations. We then create a number of univariate and multivariate models, many of which achieve comparable performance to the highest-performing models previously published for this dataset. These models also provide insights into the degradation of these cells. Our approaches can be used both to quickly train models for a new dataset and to benchmark the performance of more advanced machine learning methods.
翻译:电池使用寿命预测的数据驱动方法正在引起人们对退化机制不易理解和具备适当培训组合的应用的日益重视,然而,虽然先进的机器学习和深层次学习方法提供了高性能,具有最起码的特色工程,但更简单的“统计学习”方法往往能取得可比性能,特别是小型培训组,同时也提供了物理和统计解释性。在这项工作中,我们使用以前出版的数据集来为电池使用寿命预测开发简单、准确和可解释的数据驱动模型。我们首先将“能力矩阵”概念作为电池电子化学循环数据以及一系列特征展示的缩缩写表示。我们随后创建了一些单词和多变量模型,其中许多模型的性能与以前为该数据集公布的最优秀模型相当。这些模型还对这些细胞的退化情况提供了深刻的见解。我们的方法既可以用来快速培训新数据集模型,也可以用来衡量更先进的机器学习方法的性能。