As the demand for machine learning-based inference increases in tandem with concerns about privacy, there is a growing recognition of the need for secure machine learning, in which secret models can be used to classify private data without the model or data being leaked. Fully Homomorphic Encryption (FHE) allows arbitrary computation to be done over encrypted data, providing an attractive approach to providing such secure inference. While such computation is often orders of magnitude slower than its plaintext counterpart, the ability of FHE cryptosystems to do \emph{ciphertext packing} -- that is, encrypting an entire vector of plaintexts such that operations are evaluated elementwise on the vector -- helps ameliorate this overhead, effectively creating a SIMD architecture where computation can be vectorized for more efficient evaluation. Most recent research in this area has targeted regular, easily vectorizable neural network models. Applying similar techniques to irregular ML models such as decision forests remains unexplored, due to their complex, hard-to-vectorize structures. In this paper we present COPSE, the first system that exploits ciphertext packing to perform decision-forest inference. COPSE consists of a staging compiler that automatically restructures and compiles decision forest models down to a new set of vectorizable primitives for secure inference. We find that COPSE's compiled models outperform the state of the art across a range of decision forest models, often by more than an order of magnitude, while still scaling well.
翻译:随着对基于机器学习的推断的需求随着对隐私的关注而增加,人们日益认识到需要安全的机器学习,在这种学习中,可以使用秘密模型对私人数据进行分类而不泄漏模型或数据。完全单调加密(FHE)允许对加密数据进行任意计算,为提供这种安全推断提供了一种吸引的办法。虽然这种计算往往比普通的对口系统规模慢,但FHE加密系统进行计算的能力仍然不那么高,因为其复杂、硬至感官化结构。在这个文件中,我们经常加密整个纯文本的矢量,这样对矢量的操作进行精度评价 -- 有助于改善这一管理,有效地建立一个SIMD结构,可以对加密数据进行矢量化,以便进行更有效的评价。这一领域最近进行的大多数研究都针对常规的、易于传导的神经网络模型。对诸如决策森林等非常规的ML模型应用类似的技术,由于它们复杂、难到感应变结构。我们介绍COPSE(COPSE),第一个系统是利用加密的缩缩写模型,同时对决定的缩放模型,然后在CEUSE(CE)中进行一个自动的系统。