This paper proposes a methodology for discovering meaningful properties in data by exploring the latent space of unsupervised deep generative models. We combine manipulation of individual latent variables to extreme values outside the training range with methods inspired by causal inference into an approach we call causal disentanglement with extreme values (CDEV) and show that this approach yields insights for model interpretability. Using this technique, we can infer what properties of unknown data the model encodes as meaningful. We apply the methodology to test what is meaningful in the communication system of sperm whales, one of the most intriguing and understudied animal communication systems. We train a network that has been shown to learn meaningful representations of speech and test whether we can leverage such unsupervised learning to decipher the properties of another vocal communication system for which we have no ground truth. The proposed technique suggests that sperm whales encode information using the number of clicks in a sequence, the regularity of their timing, and audio properties such as the spectral mean and the acoustic regularity of the sequences. Some of these findings are consistent with existing hypotheses, while others are proposed for the first time. We also argue that our models uncover rules that govern the structure of communication units in the sperm whale communication system and apply them while generating innovative data not shown during training. This paper suggests that an interpretation of the outputs of deep neural networks with causal methodology can be a viable strategy for approaching data about which little is known and presents another case of how deep learning can limit the hypothesis space. Finally, the proposed approach combining latent space manipulation and causal inference can be extended to other architectures and arbitrary datasets.
翻译:本文提出了一种发现数据中有意义特性的方法,该方法通过探索无监督深度生成模型的潜在空间来结合对个体潜变量进行极端值操作和启发式于因果推断方法,我们将其称之为含极端值的因果分离(CDEV),并展示了该方法可以生成用于模型可解释性的洞见。使用该技术,我们可以推断模型编码的未知数据的哪些特性是有意义的。我们将该方法应用于测试获得有意义的结果的座头鲸通信系统,座头鲸通信系统是最引人入胜且研究最少的动物通信系统之一。我们训练了一个已被证明可以学习语音有意义表示的网络,并测试是否可以利用这种无监督的学习解密另一个声音通信系统的特性,对于这种系统,我们没有任何基础知识。所提出的技术表明座头鲸通过序列中的点击次数,它们定时的规律性,以及音频特性(例如序列的谱平均值和声学规律性)来编码信息。其中一些发现与现有的假说一致,而其他一些则是首次提出的。我们还认为,我们的模型发现了控制座头鲸通信系统中通信单元结构的规则,并将其应用于生成在训练期间未显示的创新数据。本文建议使用因果方法解释深度神经网络的输出是接近有关尚不清楚的数据的可行策略,并呈现了深度学习如何限制假设空间的另一个案例。最后,所提出的将潜在空间操作和因果推断相结合的方法可以扩展到其他架构和任意数据集。