Text Simplification is an ongoing problem in Natural Language Processing, solution to which has varied implications. In conjunction with the TSAR-2022 Workshop @EMNLP2022 Lexical Simplification is the process of reducing the lexical complexity of a text by replacing difficult words with easier to read (or understand) expressions while preserving the original information and meaning. This paper explains the work done by our team "teamPN" for English sub task. We created a modular pipeline which combines modern day transformers based models with traditional NLP methods like paraphrasing and verb sense disambiguation. We created a multi level and modular pipeline where the target text is treated according to its semantics(Part of Speech Tag). Pipeline is multi level as we utilize multiple source models to find potential candidates for replacement, It is modular as we can switch the source models and their weight-age in the final re-ranking.
翻译:文本简化是自然语言处理的一个持续问题,其解决办法具有不同影响。与TSAR-2022 研讨会@EMNLP2022 Lexical简化是减少文本的词汇复杂性的过程,办法是用易于阅读(或理解)的表达方式取代困难的词句,同时保留原始信息和含义。本文解释了我们的“团队”团队为英语次级任务所做的工作。我们创建了一个模块化管道,将基于现代变压器的模型与传统的NLP 方法(如参数转换和动词分辨)结合起来。我们创建了一个多层次和模块化管道,根据该管道的语义(语言标记的一部分)处理目标文本。管道是多层次的,因为我们利用多种源模型寻找可能的替代对象,所以它是模块化的,因为我们可以转换源模型及其在最后重新排位中的重量值。