Recent deep-learning models have achieved impressive prediction performance, but often sacrifice interpretability and computational efficiency. Interpretability is crucial in many disciplines, such as science and medicine, where models must be carefully vetted or where interpretation is the goal itself. Moreover, interpretable models are concise and often yield computational efficiency. Here, we propose adaptive wavelet distillation (AWD), a method which aims to distill information from a trained neural network into a wavelet transform. Specifically, AWD penalizes feature attributions of a neural network in the wavelet domain to learn an effective multi-resolution wavelet transform. The resulting model is highly predictive, concise, computationally efficient, and has properties (such as a multi-scale structure) which make it easy to interpret. In close collaboration with domain experts, we showcase how AWD addresses challenges in two real-world settings: cosmological parameter inference and molecular-partner prediction. In both cases, AWD yields a scientifically interpretable and concise model which gives predictive performance better than state-of-the-art neural networks. Moreover, AWD identifies predictive features that are scientifically meaningful in the context of respective domains. All code and models are released in a full-fledged package available on Github (https://github.com/Yu-Group/adaptive-wavelets).
翻译:最近深造模型取得了令人印象深刻的预测性业绩,但往往是牺牲了解释性和计算效率。在科学和医学等许多学科中,解释性至关重要,在科学和医学等许多学科中,模型必须经过仔细审查,或者解释本身是目标本身。此外,可解释的模型简洁,往往产生计算效率。在这里,我们提议采用适应性波子蒸馏法(AWD),这种方法旨在将经过训练的神经网络的信息从经培养的神经网络中提取成波子变。具体来说,AWD惩罚波盘域神经网络特性的特性,以便学习有效的多分辨率波子变。由此产生的模型具有高度预测性、简洁、计算效率,并具有便于解释的特性(如多尺度结构)。我们与域专家密切合作,我们展示AWD如何在两个现实世界环境中应对挑战:宇宙参数的推断和分子-伙伴预测。在这两种情况下,AWD产生一种科学的可解释性和简明模型,使预测性业绩比状态-艺术网络更好。此外,AWD确定预测性模型(例如多尺度结构结构),在Giromat/Giromaimal-commal-com 上,在不同的域中,所有代码/comma-commmmmmmmmmmmmmmmmmmmmmmmmmmmus。