Recent advances in reprogrammable hardware (e.g., FPGAs) and memory technology (e.g., DDR4, HBM) promise to solve performance problems inherent to graph processing like irregular memory access patterns on traditional hardware (e.g., CPU). While several of these graph accelerators were proposed in recent years, it remains difficult to assess their performance and compare them on common graph workloads and accelerator platforms, due to few open source implementations and excessive implementation effort. In this work, we build on a simulation environment for graph processing accelerators, to make several existing accelerator approaches comparable. This allows us to study relevant performance dimensions such as partitioning schemes and memory technology, among others. The evaluation yields insights into the strengths and weaknesses of current graph processing accelerators along these dimensions, and features a novel in-depth comparison.
翻译:在可重新规划硬件(如FPGAs)和记忆技术(如DDR4、HBM)方面,最近的进展有望解决像传统硬件(如CPU)不规则的内存访问模式等图形处理过程所固有的性能问题,尽管近年来提出了其中若干个图形加速器,但由于开放源码执行和过度执行,仍然难以评估其性能并在通用图表工作量和加速器平台上进行比较。在这项工作中,我们利用了图形处理加速器的模拟环境,使现有的几种加速器方法具有可比性。这使我们能够研究相关的性能层面,例如分割计划和记忆技术等。评价有助于了解这些层面当前图形处理加速器的长处和短处,并具有新的深入比较。