Since machine learning algorithms have proven their success in data mining tasks, the data with sensitive information enforce privacy preserving machine learning algorithms to emerge. Moreover, the increase in the number of data sources and the high computational power required by those algorithms force individuals to outsource the training and/or the inference of a machine learning model to the clouds providing such services. To address this dilemma, we propose a secure 3-party computation framework, CECILIA, offering privacy preserving building blocks to enable more complex operations privately. Among those building blocks, we have two novel methods, which are the exact exponential of a public base raised to the power of a secret value and the inverse square root of a secret Gram matrix. We employ CECILIA to realize the private inference on pre-trained recurrent kernel networks, which require more complex operations than other deep neural networks such as convolutional neural networks, on the structural classification of proteins as the first study ever accomplishing the privacy preserving inference on recurrent kernel networks. The results demonstrate that we perform the exact and fully private exponential computation, which is done by approximation in the literature so far. Moreover, we can also perform the exact inverse square root of a secret Gram matrix computation up to a certain privacy level, which has not been addressed in the literature at all. We also analyze the scalability of CECILIA to various settings on a synthetic dataset. The framework shows a great promise to make other machine learning algorithms as well as further computations privately computable by the building blocks of the framework.
翻译:自机器学习算法证明在数据挖掘任务中取得了成功以来,敏感信息数据使隐私保存机学习算法得以出现;此外,数据源数量的增加和这些算法所要求的高计算能力迫使个人将机器学习模型的培训和/或推断外包给提供此类服务的云层;为解决这一难题,我们提议了一个安全的三方计算框架,即CECLIA,提供隐私保护构件,以便能够私下进行更复杂的作业;在这些构件中,我们有两种新方法,即公共基地的精确指数指数增长到秘密价值和秘密Gram矩阵反正平方根的威力。我们利用CECLIA实现预先训练的经常性内核网络的私人推断,这需要比其他深层神经网络(如革命神经网络)更复杂的操作。我们建议对蛋白进行结构分类,作为第一次研究,在经常性内核网络上进一步保护隐私的推断。结果显示,我们进行精确和完全私人指数计算,这是通过对文献的精确度进行接近而完全的平方根根根根根。我们也可以在实验室上进行精确的直径的计算。