Homomorphic encryption (HE) enables secure offloading of computation to the cloud by providing computation on encrypted data (ciphertexts). HE is based on noisy encryption schemes such that noise accumulates as we apply more computation to the data. The limited number of operations applicable to the data prevents practical applications from exploiting HE. Bootstrapping enables an unlimited number of operations or fully HE (FHE) by refreshing the ciphertext. Unfortunately, bootstrapping requires a significant amount of additional computation and memory bandwidth. Prior works have proposed hardware accelerators for computation primitives of FHE. However, to the best of our knowledge, this is the first to propose a hardware FHE accelerator tailored to support bootstrapping efficiently. In particular, we propose BTS -- Bootstrappable, Technology-driven, Secure accelerator architecture for FHE. We identify the challenges of supporting bootstrapping in the accelerator and analyze the off-chip memory bandwidth and computation required. In particular, given the limitations of modern memory technology, we identify the HE parameter sets that are efficient for FHE acceleration. Based on the insights from our analysis, we propose BTS that effectively exploits parallelism innate in HE operations by arranging a massive number of processing elements in a grid. We present the design and microarchitecture of BTS, including the network-on-chip that exploits the deterministic communication pattern. BTS shows 5,556$\times$ and 1,306$\times$ improved execution time on ResNet-20 and logistic regression over CPU, using 373.6mm$^2$ chip area and up to 133.8W of power.
翻译:计算机加密( HH) 通过提供加密数据( 密码文本) 的计算, 能够安全地卸载计算云层的计算 。 HE 是基于噪音加密计划, 这样噪音会随着我们对数据进行更多的计算而累积。 适用于数据的操作数量有限, 使得无法实际应用 HE 。 启动使操作数量无限, 或完全 HE (FHE) 能够刷新密码。 不幸的是, 靴子需要大量额外的计算和记忆带宽。 先前的工程提出了用于计算FHE原始数据的硬件加速器 。 然而, 据我们所知, 这是首次提出一个用于高效支持制动的 FHE 加速器硬件。 特别是, 我们建议 BTS - 启动、 技术驱动、 安全加速器架构。 我们确定了支持在加速器中进行踢踏的难度, 并分析离子存储存储器存储器的值, 特别是由于现代记忆技术的局限性, 我们确定了用于FHEE 加速系统运行的HE 加速器 。 我们提议在快速化电路段中, 将一个快速化的计算模型进行 。