Failures in optical network backbone can lead to major disruption of internet data traffic. Hence, minimizing such failures is of paramount importance for the network operators. Even better, if the network failures can be predicted and preventive steps can be taken in advance to avoid any disruption in traffic. Various data driven and machine learning techniques have been proposed in literature for failure prediction. Most of these techniques need real time data from the networks and also need different monitors to measure key optical parameters. This means provision for failure prediction has to be available in network nodes, e.g., routers and network management systems. However, sometimes deployed networks do not have failure prediction built into their initial design but subsequently need arises for such mechanisms. For such systems, there are two key challenges. Firstly, statistics of failure distribution, data, etc., are not readily available. Secondly, major changes cannot be made to the network nodes which are already commercially deployed. This paper proposes a novel implementable non-intrusive failure prediction mechanism for deployed network nodes using information from log files of those devices. Numerical results show that the mechanism has near perfect accuracy in predicting failures of individual network nodes.
翻译:光学网络主干网的故障可能导致互联网数据传输的重大中断。 因此, 最大限度地减少这类故障对网络操作员至关重要。 更好的是, 如果可以预测网络故障, 并且可以提前采取预防性步骤, 以避免交通中断, 文献中提出了各种数据驱动和机器学习技术, 以预测故障。 这些技术大多需要网络实时数据, 还需要不同的监测器来测量关键光学参数。 这意味着, 网络节点( 如路由器和网络管理系统)必须提供故障预测。 但是, 有时, 部署的网络在最初设计中没有失败预测, 但随后需要这类机制。 对于这些系统, 有两个关键挑战: 首先, 无法随时获得关于故障分布、 数据等的统计数据。 其次, 无法对已经商业上部署的网络节点进行重大修改。 本文提出了一个新颖的可执行的非入侵性故障预测机制, 用于利用这些装置的日志文档中的信息部署网络节点。 数字结果显示, 机制在预测单个网络节点的故障方面几乎准确。