Structured prediction in natural language processing (NLP) has a long history. The complex models of structured application come at the difficulty of learning and inference. These difficulties lead researchers to focus more on models with simple structure components (e.g., local classifier). Deep representation learning has become increasingly popular in recent years. The structure components of their method, on the other hand, are usually relatively simple. We concentrate on complex structured models in this dissertation. We provide a learning framework for complicated structured models as well as an inference method with a better speed/accuracy/search error trade-off. The dissertation begins with a general introduction to energy-based models. In NLP and other applications, an energy function is comparable to the concept of a scoring function. In this dissertation, we discuss the concept of the energy function and structured models with different energy functions. Then, we propose a method in which we train a neural network to do argmax inference under a structured energy function, referring to the trained networks as "inference networks" or "energy-based inference networks". We then develop ways of jointly learning energy functions and inference networks using an adversarial learning framework. Despite the inference and learning difficulties of energy-based models, we present approaches in this thesis that enable energy-based models more easily to be applied in structured NLP applications.
翻译:自然语言处理(NLP)的结构化预测具有悠久的历史。结构化应用的复杂模型是学习和推断的困难所在。这些困难导致研究人员更加关注结构化组成部分的模型(例如地方分类师)。深层代表性学习近年来越来越受欢迎。其方法的结构组成部分通常比较简单。我们在这个论文中集中关注复杂的结构化模型。我们为复杂的结构化模型提供了一个学习框架,并提供了一种推论方法,具有更好的速度/准确性/搜索错误取舍。消化方法始于对能源模型的一般介绍。在NLP和其他应用中,一种能源功能与评分功能的概念相似。在这种评分中,我们讨论了能源功能的概念和结构化模型的不同能源功能。然后,我们提出一种方法,在结构化能源功能下,我们训练一个神经网络,在结构化能源功能下,把经过训练的网络称为“推导的网络”或“基于能源的误判”应用模型。我们随后在对抗性网络中学习了一种方法。我们用这种方法来学习这种能源能力模型。我们用这种方法来学习这种理论,然后又在共同学习能源的网络中学习这种方法。