Challenging the Nvidia monopoly, dedicated AI-accelerator chips have begun emerging for tackling the computational challenge that the inference and, especially, the training of modern deep neural networks (DNNs) poses to modern computers. The field has been ridden with studies assessing the performance of these contestants across various DNN model types. However, AI-experts are aware of the limitations of current DNNs and have been working towards the fourth AI wave which will, arguably, rely on more biologically inspired models, predominantly on spiking neural networks (SNNs). At the same time, GPUs have been heavily used for simulating such models in the field of computational neuroscience, yet AI-chips have not been tested on such workloads. The current paper aims at filling this important gap by evaluating multiple, cutting-edge AI-chips (Graphcore IPU, GroqChip, Nvidia GPU with Tensor Cores and Google TPU) on simulating a highly biologically detailed model of a brain region, the inferior olive (IO). This IO application stress-tests the different AI-platforms for highlighting architectural tradeoffs by varying its compute density, memory requirements and floating-point numerical accuracy. Our performance analysis reveals that the simulation problem maps extremely well onto the GPU and TPU architectures, which for networks of 125,000 cells leads to a 28x respectively 1,208x speedup over CPU runtimes. At this speed, the TPU sets a new record for largest real-time IO simulation. The GroqChip outperforms both platforms for small networks but, due to implementing some floating-point operations at reduced accuracy, is found not yet usable for brain simulation.
翻译:在Nvidia的垄断中,专门的AI加速器芯片已经开始出现,用于应对现代计算机所面临的计算挑战。 实地的研究已经遍及这些竞争者在各种DNN模式类型中的性能评估。 然而,AI专家意识到当前DNN的局限性,并一直致力于第四个AI波,这可以说,主要依靠的是蒸发神经网络(SNN)的快速模型。 与此同时,GPU大量用于模拟计算神经科学领域的此类模型,特别是培训现代深神经网络(DNNN),然而,AIchips尚未在此类工作量上测试。 目前的文件旨在填补这一重要差距,方法是评估多个、尖端的AI-chips(Graphore IMU、GroqChip、Nvidia GPUPER 和Google TPU)的局限性,用来模拟一个高度生物详细的大脑区域流动模型,但较低级的橄榄(IO)运行。这个IO应用GPU值模型的运行压力-tor-model 运行图,用来显示我们不同的I-ral IMF 的IMF IMF IMF IMF 。