Fast and accurate performance analysis techniques are essential in early design space exploration and pre-silicon evaluations, including software eco-system development. In particular, on-chip communication continues to play an increasingly important role as the many-core processors scale up. This paper presents the first performance analysis technique that targets networks-on-chip (NoCs) that employ weighted round-robin (WRR) arbitration. Besides fairness, WRR arbitration provides flexibility in allocating bandwidth proportionally to the importance of the traffic classes, unlike basic round-robin and priority-based arbitration. The proposed approach first estimates the effective service time of the packets in the queue due to WRR arbitration. Then, it uses the effective service time to compute the average waiting time of the packets. Next, we incorporate a decomposition technique to extend the analytical model to handle NoC of any size. The proposed approach achieves less than 5% error while executing real applications and 10% error under challenging synthetic traffic with different burstiness levels.
翻译:快速和准确的性能分析技术对于早期设计空间探索和硅前评估至关重要,包括软件生态系统开发。特别是,芯片通信随着多个核心处理器的扩大,继续发挥越来越重要的作用。本文件介绍了针对使用加权圆环(WRR)仲裁的芯片上网络(NOC)的第一个性能分析技术。除了公平外,WRR仲裁在根据交通级别的重要性按比例分配带宽方面提供了灵活性,这与基本圆环和优先级仲裁不同。拟议方法首先估计了排队中包件的有效服务时间,因为WRR仲裁。然后,它利用有效服务时间计算包件的平均等待时间。接下来,我们采用了分解技术,将分析模型扩大到处理任何大小的NOC。拟议方法的误差不到5%,同时在以不同易爆程度挑战合成交通中执行实际应用和误差10%。