Both brain science and the deep learning communities have the problem of interpreting neural activity. For deep learning, even though we can access all neurons' activity data, interpretation of how the deep network solves the task is still challenging. Although a large amount of effort has been devoted to interpreting a deep network, there is still no consensus of what interpretation is. This paper tries to push the discussion in this direction and proposes an information-theoretic progressive framework to synthesize interpretation. Firstly, we discuss intuitions of interpretation: interpretation is meta-information; interpretation should be at the right level; inducing independence is helpful to interpretation; interpretation is naturally progressive; interpretation doesn't have to involve a human. Then, we build the framework with an information map splitting idea and implement it with the variational information bottleneck technique. After that, we test the framework with the CLEVR dataset. The framework is shown to be able to split information maps and synthesize interpretation in the form of meta-information.
翻译:大脑科学和深层学习社区都存在对神经活动的解读问题。 对于深层次的学习来说,尽管我们可以获取所有神经人的活动数据,但对于深网络如何解决这项任务的解释仍然具有挑战性。虽然已经花费了大量精力来解释深网络,但对于什么是解释仍然没有共识。本文试图将讨论推向这个方向,并提出了一个信息理论进步框架来综合解释。首先,我们讨论解释的直觉:解释是元信息;解释应该处于正确的水平;促使独立有助于解释;解释是自然进步的;解释不必涉及人类。然后,我们用信息地图分裂的想法来构建框架,然后用变异信息瓶颈技术来实施。之后,我们用CLEVR数据集来测试框架。这个框架可以将信息地图和以元信息的形式合成解释。