This thesis develops the translation between category theory and computational linguistics as a foundation for natural language processing. The three chapters deal with syntax, semantics and pragmatics. First, string diagrams provide a unified model of syntactic structures in formal grammars. Second, functors compute semantics by turning diagrams into logical, tensor, neural or quantum computation. Third, the resulting functorial models can be composed to form games where equilibria are the solutions of language processing tasks. This framework is implemented as part of DisCoPy, the Python library for computing with string diagrams. We describe the correspondence between categorical, linguistic and computational structures, and demonstrate their applications in compositional natural language processing.
翻译:这个理论发展了分类理论和计算语言之间的翻译,作为自然语言处理的基础。 这三章涉及语法、语义和实用。 首先, 字符串图提供了正式语法结构的统一模型。 第二, 配方计算词义, 将图表转换成逻辑、 发声、 神经或量子计算。 第三, 由此形成的调情模型可以组成游戏, 使语言处理任务的解决办法是平衡。 这个框架作为DisCoPy( 用字符串图计算时使用的 Python 图书馆)的一部分来实施。 我们描述直截了当、 语言和计算结构之间的对应关系, 并在组成自然语言处理中演示其应用 。