This paper proposes a sparse Bayesian treatment of deep neural networks (DNNs) for system identification. Although DNNs show impressive approximation ability in various fields, several challenges still exist for system identification problems. First, DNNs are known to be too complex that they can easily overfit the training data. Second, the selection of the input regressors for system identification is nontrivial. Third, uncertainty quantification of the model parameters and predictions are necessary. The proposed Bayesian approach offers a principled way to alleviate the above challenges by marginal likelihood/model evidence approximation and structured group sparsity-inducing priors construction. The identification algorithm is derived as an iterative regularized optimization procedure that can be solved as efficiently as training typical DNNs. Furthermore, a practical calculation approach based on the Monte-Carlo integration method is derived to quantify the uncertainty of the parameters and predictions. The effectiveness of the proposed Bayesian approach is demonstrated on several linear and nonlinear systems identification benchmarks with achieving good and competitive simulation accuracy.
翻译:本文建议对深神经网络(DNN)进行稀有的巴伊西亚处理,以便进行系统识别。虽然DNN在各个领域表现出令人印象深刻的近似能力,但在系统识别问题上仍然存在若干挑战。首先,DNN被认为过于复杂,很容易使培训数据过于夸大。第二,选择系统识别输入递减器是非三重性的。第三,模型参数和预测的不确定性是有必要的。拟议的Bayesian方法提供了一个原则性方法,通过边际可能性/模型证据近似和结构化的群集模拟前程构建来减轻上述挑战。识别算法是作为迭接的常规优化程序产生的,可以像培训典型DNN那样有效地解决。此外,基于蒙特-卡洛整合方法的实用计算方法可以量化参数和预测的不确定性。拟议的Bayesian方法的有效性体现在若干线性和非线性系统识别基准上,并实现了良好和竞争性的模拟准确性。