OrbWeaver, an automatic knowledge extraction system paired with a human interface, streamlines the use of unintuitive natural language processing software for modeling systems from their documentation. OrbWeaver enables the indirect transfer of knowledge about legacy systems by leveraging open source tools in document understanding and processing as well as using web based user interface constructs. By design, OrbWeaver is scalable, extensible, and usable; we demonstrate its utility by evaluating its performance in processing a corpus of documents related to advanced persistent threats in the cyber domain. The results indicate better knowledge extraction by revealing hidden relationships, linking co-related entities, and gathering evidence.
翻译:OrbWeaver是一个自动知识提取系统,配有人际界面,它简化了使用不直观的自然语言处理软件,用于从文档中建模系统。OrbWeaver通过在文件理解和处理中利用开放源码工具以及使用基于网络的用户界面结构,间接转让有关遗留系统的知识。根据设计,OrbWeaver可以伸缩、推广和使用;我们通过评价其在处理一系列与网络领域长期存在的先进威胁有关的文件方面的性能来证明它的效用。结果显示,通过揭示隐藏的关系、连接相关实体和收集证据,可以更好地提取知识。