We release an open library, called TextBox, which provides a unified, modularized, and extensible text generation framework. TextBox aims to support a broad set of text generation tasks and models. In TextBox, we implements several text generation models on benchmark datasets, covering the categories of VAE, GAN, pre-trained language models, etc. Meanwhile, our library maintains sufficient modularity and extensibility by properly decomposing the model architecture, inference, learning process into highly reusable modules, which allows easily incorporating new models into our framework. It is specially suitable for researchers and practitioners to efficiently reproduce baseline models and develop new models. TextBox is implemented based on PyTorch, and released under Apache License 2.0 at the link \url{https://github.com/RUCAIBox/TextBox}.
翻译:我们推出了一个名为TextBox的开放图书馆,它提供了一个统一、模块化和可扩展的文本生成框架。TextBox旨在支持一套广泛的文本生成任务和模型。在TextBox中,我们实施了几个关于基准数据集的文本生成模型,涵盖VAE、GAN、预先培训的语言模型等类别。与此同时,我们的图书馆保持了足够的模块性和可扩展性,将模型结构、推论、学习过程适当地分解为高度可重复使用的模块,从而便于将新模型纳入我们的框架。研究人员和从业人员特别适合高效复制基线模型和开发新模型。TextBox以PyTorrch为基础实施,并在链接\url{https://github.com/RUCAIBox/TextBox}的Apachelic 2.0下发布。