We introduce Texar, an open-source toolkit aiming to support the broad set of text generation tasks that transforms any inputs into natural language, such as machine translation, summarization, dialog, content manipulation, and so forth. With the design goals of modularity, versatility, and extensibility in mind, Texar extracts common patterns underlying the diverse tasks and methodologies, creates a library of highly reusable modules and functionalities, and allows arbitrary model architectures and algorithmic paradigms. In Texar, model architecture, losses, and learning processes are fully decomposed. Modules at high concept level can be freely assembled or plugged in/swapped out. These features make Texar particularly suitable for researchers and practitioners to do fast prototyping and experimentation, as well as foster technique sharing across different text generation tasks. We provide case studies to demonstrate the use and advantage of the toolkit. Texar is released under Apache license 2.0 at https://github.com/asyml/texar.
翻译:我们引入了Texar,这是一个开放源码工具包,旨在支持将任何投入转化为自然语言的广泛的文本生成任务,如机器翻译、总结、对话、内容处理等。Texar在设计模块化、多功能和可扩展性等设计目标下,提取了不同任务和方法的共同模式,创建了一个高度可重复使用的模块和功能库,并允许任意的模型架构和算法范式。在Texar,模型架构、损失和学习过程完全分解。高概念级别的模块可以自由组装或插入/插入。这些特征使得Texar特别适合研究人员和从业人员快速进行原型和实验,并促进不同文本生成任务的技术共享。我们提供了案例研究,以展示工具的使用和优势。Texar在https://github.com/asyml/texar上根据阿帕奇牌2.0许可证发布。