Incident management is a critical part of the DevOps processes for developing and operating large-scale services in the cloud. Incident reports filed by customers are largely unstructured making any automated diagnosis or mitigation non-trivial. It requires on-call engineers to parse verbose reports to understand the issue and locate key information. Prior work has looked into extraction of key attributes or entities like error codes, tenant Ids, stack traces, etc. from incident and bug reports. Although a flat list of entities is informative, to unlock the full potential of knowledge extraction, it is necessary to provide context to these entities. For instance, the relations between the real-world concepts or objects that these entities represent in otherwise unstructured data is useful for downstream tasks like incident linking, triaging and mitigation. With this additional context, entities are transformed from "Strings" to "Things". In this work, we present an approach to mine and score binary entity relations from co-occurring entity pairs. We evaluate binary relations extracted and show that our approach has a high precision of 0.9. Further, we construct knowledge graphs automatically and show that the implicit knowledge in the graph can be used to mine and rank relevant entities for distinct incidents, by mapping entities to clusters of incident titles.
翻译:事件管理是DevOps在云层中开发和运行大型服务的关键部分。客户提交的事故报告基本上没有结构化,无法进行自动诊断或缓解非三重数据。 它要求待命工程师分析动词报告,以了解问题和找到关键信息。 先前的工作是从事件和错误报告中提取关键属性或实体, 如错误代码、 租户代号、 堆积痕迹等。 虽然一个实体平板名单信息丰富, 能够释放全部的知识提取潜力, 但有必要为这些实体提供背景。 例如, 这些实体在非结构化数据中所代表的真实世界概念或对象之间的关系, 有助于下游任务, 如事件连接、 三角和缓解。 在这种额外的背景下, 各实体从“ 环” 转换为“ 线索 ” 。 在这项工作中, 我们提出了一个方法, 从共同实体对立的对立对立中分数。 我们评估了二进制关系, 并表明我们的方法非常精确 0. 9。 此外, 我们自动构建了知识图表, 并显示这些实体在事件分类中的隐含性知识, 可以用来绘制与地雷有关的实体的图表。