Drawing from memory the face of a friend you have not seen in years is a difficult task. However, if you happen to cross paths, you would easily recognize each other. The biological memory is equipped with an impressive compression algorithm that can store the essential, and then infer the details to match perception. Willshaw's model of Associative memory is a likely candidate for a computational model of this brain function, but its application on real-world data is hindered by the so-called Sparse Coding Problem. Due to a recently proposed sparse encoding prescription [31], which maps visual patterns into binary feature maps, we were able to analyze the behavior of the Willshaw Network (WN) on real-world data and gain key insights into the strengths of the model. To further enhance the capabilities of the WN, we propose the Multiple-Modality architecture. In this new setting, the memory stores several modalities (e.g., visual, or textual) simultaneously. After training, the model can be used to infer missing modalities when just a subset is perceived, thus serving as a flexible framework for learning tasks. We evaluated the model on the MNIST dataset. By storing both the images and labels as modalities, we were able to successfully perform pattern completion, classification, and generation with a single model.
翻译:绘制多年没见过的朋友的记忆,这是一个困难的任务。 但是, 如果您碰巧遇到交叉路径, 您就会很容易认出对方。 生物记忆配有令人印象深刻的压缩算法, 可以存储基本数据, 然后推导细节以匹配感知。 Willshaw 的组合记忆模型可能是该大脑功能计算模型的候选对象, 但是它在现实世界数据中的应用受到所谓的“ 粗略编码问题” 的阻碍。 由于最近提出的稀疏编码处方[31], 将视觉模式映射为二进制特征地图, 我们得以分析威尔肖网络(WN)在现实世界数据上的行为, 并获得对模型长处的关键洞察力。 为了进一步增强网络的能力, 我们提出了多模式结构。 在这个新环境中, 记忆储存了多种模式( 如视觉或文字 ) 。 培训后, 模型可以用来推断缺失模式, 只要看到一个子集, 从而作为学习任务的灵活框架, 就能分析威尔肖网络(WN) 在真实世界数据上的行为, 并获取模型的长处洞察力。 我们评估了模型, 将它作为单个的模型, 并成功存储了生成模式。 。 通过存储了一个模型, 将它作为单一的模型, 和单个的模型, 将它作为一种模式, 保存。