与深神经网络一起学习如何代表和预测群集 (Learning to Represent and Predict Sets with Deep Neural Networks)

In this thesis, we develop various techniques for working with sets in machine learning. Each input or output is not an image or a sequence, but a set: an unordered collection of multiple objects, each object described by a feature vector. Their unordered nature makes them suitable for modeling a wide variety of data, ranging from objects in images to point clouds to graphs. Deep learning has recently shown great success on other types of structured data, so we aim to build the necessary structures for sets into deep neural networks. The first focus of this thesis is the learning of better set representations (sets as input). Existing approaches have bottlenecks that prevent them from properly modeling relations between objects within the set. To address this issue, we develop a variety of techniques for different scenarios and show that alleviating the bottleneck leads to consistent improvements across many experiments. The second focus of this thesis is the prediction of sets (sets as output). Current approaches do not take the unordered nature of sets into account properly. We determine that this results in a problem that causes discontinuity issues with many set prediction tasks and prevents them from learning some extremely simple datasets. To avoid this problem, we develop two models that properly take the structure of sets into account. Various experiments show that our set prediction techniques can significantly benefit over existing approaches.

翻译：在此论文中,我们开发了与机器学习中的数据集合作的各种技巧。每一个输入或输出都不是图像或序列, 而是一组: 一个没有顺序的多个对象的集合, 每个对象都是由特性矢量描述的。它们没有顺序的性质使得它们适合于建模各种各样的数据, 从图像中的天体到点云层到图形。深层次的学习最近在其他类型的结构化数据上显示了巨大的成功, 因此我们的目标是为各组建立必要的结构化结构进入深层的神经网络。这个理论的第一个焦点是学习更精确的表达方式( 作为输入的设置) 。现有的方法有瓶颈, 阻止它们正确建模集中对象之间的关系。为了解决这个问题, 我们为不同的情景开发了各种各样的技术, 并表明减轻瓶颈可以导致许多实验的一致改进。这个理论的第二个重点是对各组的预测( 设定为输出 ) 。目前的方法并不适当地考虑到各组的不顺序化性质。我们确定, 这个问题导致不连续的问题, 许多设定了预测任务, 并阻止它们学习一些极其简单的数据集。为了避免这个模型, 我们设置了两个模型。正确的计算出我们现有的方法。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

《动手学深度学习》(Dive into Deep Learning)PyTorch实现

专知会员服务

121+阅读 · 2019年12月31日