Machine reading comprehension have been intensively studied in recent years, and neural network-based models have shown dominant performances. In this paper, we present a Sogou Machine Reading Comprehension (SMRC) toolkit that can be used to provide the fast and efficient development of modern machine comprehension models, including both published models and original prototypes. To achieve this goal, the toolkit provides dataset readers, a flexible preprocessing pipeline, necessary neural network components, and built-in models, which make the whole process of data preparation, model construction, and training easier.
翻译:近些年来,对机器阅读理解进行了深入的研究,神经网络模型表现出了占主导地位的性能。 在本文中,我们提出了一个索古机器阅读理解工具(SMRC)工具包,可用于提供现代机器理解模型的快速和高效开发,包括已公布的模型和原原型。 为实现这一目标,工具包提供数据集阅读器、灵活的预处理管道、必要的神经网络组件和内建模型,使整个数据编制、模型构建和培训过程更加容易。