Recently, Graphcore has introduced an IPU Processor for accelerating machine learning applications. The architecture of the processor has been designed to achieve state of the art performance on current machine intelligence models for both training and inference. In this paper, we report on a benchmark in which we have evaluated the performance of IPU processors on deep neural networks for inference. We focus on deep vision models such as ResNeXt. We report the observed latency, throughput and energy efficiency.
翻译:最近,Greacore推出了一个加速机器学习应用的议会联盟处理器,该处理器的结构设计是为了在目前的培训和推理机情报模型上取得最新业绩,我们在本文件中报告一个基准,我们在这个基准中评估了议会联盟处理器在深度神经网络上的推理性能,我们侧重于ResNeXt等深视模型,我们报告了观察到的潜伏、吞吐量和能效。