Fully Homomorphic Encryption over the Torus (TFHE) allows arbitrary computations to happen directly on ciphertexts using homomorphic logic gates. However, each TFHE gate on state-of-the-art hardware platforms such as GPUs and FPGAs is extremely slow ($>0.2ms$). Moreover, even the latest FPGA-based TFHE accelerator cannot achieve high energy efficiency, since it frequently invokes expensive double-precision floating point FFT and IFFT kernels. In this paper, we propose a fast and energy-efficient accelerator, MATCHA, to process TFHE gates. MATCHA supports aggressive bootstrapping key unrolling to accelerate TFHE gates without decryption errors by approximate multiplication-less integer FFTs and IFFTs, and a pipelined datapath. Compared to prior accelerators, MATCHA improves the TFHE gate processing throughput by $2.3\times$, and the throughput per Watt by $6.3\times$.
翻译:Torus (TFHE) 的全基因加密允许使用同质逻辑门直接在密码文本上进行任意计算。 但是, GPUs 和 FPGAs 等最先进的硬件平台上的每个TFHE门都非常慢( 0. 2 亿美元 ) 。 此外, 即使是最新的基于 FPGA 的 FTHE 加速器也不可能实现高能效, 因为它经常引用昂贵的双精度浮动点 FFFT 和 FFFFT 内核。 在本文中, 我们提议用一个快速和节能的加速器MATCHA 来处理TFHE 门。 MATCHA 支持通过粗略的无倍增倍整FFFTs 和 IFFFTs 和管道数据路径, 加速TFHE门的快速和节能控制器。 与前加速器相比, MATCHA 改进了THE门的处理量为2.3\ times, 和每Watt 的过量为6.3\time $ 。