In this work, we share our experience on tele-knowledge pre-training for fault analysis. Fault analysis is a vital task for tele-application, which should be timely and properly handled. Fault analysis is also a complex task, that has many sub-tasks. Solving each task requires diverse tele-knowledge. Machine log data and product documents contain part of the tele-knowledge. We create a Tele-KG to organize other tele-knowledge from experts uniformly. With these valuable tele-knowledge data, in this work, we propose a tele-domain pre-training model KTeleBERT and its knowledge-enhanced version KTeleBERT, which includes effective prompt hints, adaptive numerical data encoding, and two knowledge injection paradigms. We train our model in two stages: pre-training TeleBERT on 20 million telecommunication corpora and re-training TeleBERT on 1 million causal and machine corpora to get the KTeleBERT. Then, we apply our models for three tasks of fault analysis, including root-cause analysis, event association prediction, and fault chain tracing. The results show that with KTeleBERT, the performance of task models has been boosted, demonstrating the effectiveness of pre-trained KTeleBERT as a model containing diverse tele-knowledge.
翻译:在这项工作中,我们分享了自己在远程知识前培训错误分析方面的经验。 过失分析是远程应用的重要任务,应该及时和妥善处理。 过失分析也是一项复杂的任务,有许多子任务。 解决每项任务需要不同的远程知识。 机器日志数据和产品文件包含远程知识的一部分。 我们创建了远程KG, 以统一组织专家提供的其他远程知识。 有了这些宝贵的远程知识数据, 我们在此工作中提出了远程培训前模型 KTELBERT及其知识强化版本 KTELEBERT, 其中包括有效的即时提示、适应性数字数据编码和两个知识注入模式。 我们分两个阶段培训我们的模型: 对2 000万个电信公司进行预先培训 TeleBERT, 对100万个因果和机器公司进行再培训,以获得KTELERT。 然后, 我们运用了我们的模型来进行三项错误分析, 包括根源分析、事件关联预测和错误链追踪。 成果显示, KERBERT 的模型具有增强作用, 以演示了多样化的模型。