NLP Workbench is a web-based platform for text mining that allows non-expert users to obtain semantic understanding of large-scale corpora using state-of-the-art text mining models. The platform is built upon latest pre-trained models and open source systems from academia that provide semantic analysis functionalities, including but not limited to entity linking, sentiment analysis, semantic parsing, and relation extraction. Its extensible design enables researchers and developers to smoothly replace an existing model or integrate a new one. To improve efficiency, we employ a microservice architecture that facilitates allocation of acceleration hardware and parallelization of computation. This paper presents the architecture of NLP Workbench and discusses the challenges we faced in designing it. We also discuss diverse use cases of NLP Workbench and the benefits of using it over other approaches. The platform is under active development, with its source code released under the MIT license. A website and a short video demonstrating our platform are also available.
翻译:NLP Workbench是一个基于网络的文本开采平台,使非专家用户能够利用最新文本开采模型对大型公司进行语义理解,该平台以学术界提供语义分析功能的最新预先培训模型和开放源码系统为基础,包括但不限于实体连接、情绪分析、语义分解和关系提取。该平台的扩展设计使研究人员和开发商能够顺利地替换现有模型或整合新的模型。为了提高效率,我们采用了一种微观服务架构,便利加速硬件的分配和计算平行。本文介绍了NLP Workbench的架构,并讨论了我们在设计过程中面临的挑战。我们还讨论了NLP Workbench的各种使用案例,以及将其用于其他方法的好处。该平台正在积极开发中,其源代码根据麻省理工学院许可证发布。还有一个网站和一个短视频,展示了我们的平台。</s>