The growing quantity and complexity of data pose challenges for humans to consume information and respond in a timely manner. For businesses in domains with rapidly changing rules and regulations, failure to identify changes can be costly. In contrast to expert analysis or the development of domain-specific ontology and taxonomies, we use a task-based approach for fulfilling specific information needs within a new domain. Specifically, we propose to extract task-based information from incoming instance data. A pipeline constructed of state of the art NLP technologies, including a bi-LSTM-CRF model for entity extraction, attention-based deep Semantic Role Labeling, and an automated verb-based relationship extractor, is used to automatically extract an instance level semantic structure. Each instance is then combined with a larger, domain-specific knowledge graph to produce new and timely insights. Preliminary results, validated manually, show the methodology to be effective for extracting specific information to complete end use-cases.
翻译:与专家分析或开发特定领域的本体学和分类学不同,我们采用基于任务的方法在新的领域满足具体的信息需求。具体地说,我们提议从新案例数据中提取基于任务的信息。我们提议从新案例数据中提取基于任务的信息。由先进的NLP技术构成的管道,包括用于实体提取的双-LSTM-CRF模型、基于关注的深层语义作用定位和基于自动动词的关系提取器,用于自动提取实例级语义结构。然后,每个实例与更大的特定领域知识图相结合,产生新的和及时的洞察力。经过验证的手工手动初步结果显示为完成最终使用案例而提取具体信息的有效方法。