Motivated by the increasing interest in the posit numeric format, in this paper we evaluate the accuracy and efficiency of posit arithmetic in contrast to the traditional IEEE 754 32-bit floating-point (FP32) arithmetic. We first design and implement a Posit Arithmetic Unit (PAU), called POSAR, with flexible bit-sized arithmetic suitable for applications that can trade accuracy for savings in chip area. Next, we analyze the accuracy and efficiency of POSAR with a series of benchmarks including mathematical computations, ML kernels, NAS Parallel Benchmarks (NPB), and Cifar-10 CNN. This analysis is done on our implementation of POSAR integrated into a RISC-V Rocket Chip core in comparison with the IEEE 754-based Floting Point Unit (FPU) of Rocket Chip. Our analysis shows that POSAR can outperform the FPU, but the results are not spectacular. For NPB, 32-bit posit achieves better accuracy than FP32 and improves the execution by up to 2%. However, POSAR with 32-bit posit needs 30% more FPGA resources compared to the FPU. For classic ML algorithms, we find that 8-bit posits are not suitable to replace FP32 because they exhibit low accuracy leading to wrong results. Instead, 16-bit posit offers the best option in terms of accuracy and efficiency. For example, 16-bit posit achieves the same Top-1 accuracy as FP32 on a Cifar-10 CNN with a speedup of 18%.
翻译:基于对正数格式的日益浓厚的兴趣,本文件中我们对照传统的 IEEE 754 32 位浮动点(FP32)算术,评估了正数算术的准确性和效率。我们首先设计和实施一个称为POSAR的POSAR(POSAR)软化的比特规模算术,该算术适合于能够交易精度以节省芯片领域的节省。接着,我们用一系列基准,包括数学计算、ML内核、NAS平行基准(NPB)和Cifar-10CNN,来分析POSAR的准确性和效率。我们首先对POSAR的安装与RISC-V火箭芯片核心部分(FP32)的安装进行了分析。我们的分析表明,POSAR能够比芯片领域节省的节省量更精确性更灵活,但结果并不惊人。对于NPB,32比PL的准确性要好,并且将执行率提高到2 %。然而,POSAR的PAR和32比比PG的精度需要增加30%的精度。比值,比值要达到18 %的PGA资源,比值要比值要比值要比值要低,因为我们找到了的精度要比值要低。