WikiDominer:维基百科维基百科 (WikiDoMiner: Wikipedia Domain-specific Miner)

We introduce WikiDoMiner, a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (DOI: 10.5281/zenodo.6671357).

翻译：我们引入了维基Dominer( WikiDominer ), 这是一个通过爬行维基百科自动生成特定域子公司的工具。维基Dominer 帮助工程师要求创建与特定要求规格(RS) 基本领域相关的外部知识资源。能够建立这种资源很重要, 因为特定域数据集稀缺。维基Dominer 首次从给定的RS 中提取一套特定域子, 并询问维基百科获取这些关键词。维基Dominer 的输出是一套与输入RS 领域相关的维基百科文章。用于特定域知识的开采维基百科可以用于多项要求工程任务, 例如, 模糊性处理、要求分类和问题解答。维基Dominer 在Zenodo 上公开提供公开源许可证( DOI: 105281/zenodo.6671357 )。

相关内容

关注 0

该杂志提供了一个重点，传播关于软件密集型信息系统或应用程序需求的获取、表示和验证的新结果。欢迎提交理论和应用性意见，但所有文件都必须明确说明： - 这些思想对复杂系统设计的实际影响 - 思考型实践者应该如何评价这些想法《华尔街日报》的动机是一种多学科的观点，这种观点不仅考虑了软件组件规范方面的需求，而且还考虑了在组织和社会环境中进行的激发、表示和同意需求的活动。为此，人们从软件工程、信息系统、职业社会学、认知和组织心理学、人机交互、计算机支持的合作工作、语言学和哲学等领域寻求贡献，以解决具体的需求工程问题。官网链接：http://dblp.uni-trier.de/db/journals/re/

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日