The everlasting demand for higher computing power for deep neural networks (DNNs) drives the development of parallel computing architectures. 3D integration, in which chips are integrated and connected vertically, can further increase performance because it introduces another level of spatial parallelism. Therefore, we analyze dataflows, performance, area, power and temperature of such 3D-DNN-accelerators. Monolithic and TSV-based stacked 3D-ICs are compared against 2D-ICs. We identify workload properties and architectural parameters for efficient 3D-ICs and achieve up to 9.14x speedup of 3D vs. 2D. We discuss area-performance trade-offs. We demonstrate applicability as the 3D-IC draws similar power as 2D-ICs and is not thermal limited.
翻译:深海神经网络对更高计算能力的长期需求驱动着平行计算结构的发展。 3D集成(芯片是集成和垂直连接的)可以进一步提高性能,因为它引入了另一种空间平行水平。 因此,我们分析3D-DNN加速器的数据流、性能、面积、功率和温度。单体和基于TSV的堆叠型3D-ICS与2D-ICS相比。我们为高效的3D-ICS确定工作量属性和建筑参数,并实现最多9.14x3D对2D的加速。我们讨论区域性能权衡。我们证明3D-IC的可适用性与2D-IC系统相似,且不受热限。