Confidential multi-stakeholder machine learning (ML) allows multiple parties to perform collaborative data analytics while not revealing their intellectual property, such as ML source code, model, or datasets. State-of-the-art solutions based on homomorphic encryption incur a large performance overhead. Hardware-based solutions, such as trusted execution environments (TEEs), significantly improve the performance in inference computations but still suffer from low performance in training computations, e.g., deep neural networks model training, because of limited availability of protected memory and lack of GPU support. To address this problem, we designed and implemented Perun, a framework for confidential multi-stakeholder machine learning that allows users to make a trade-off between security and performance. Perun executes ML training on hardware accelerators (e.g., GPU) while providing security guarantees using trusted computing technologies, such as trusted platform module and integrity measurement architecture. Less compute-intensive workloads, such as inference, execute only inside TEE, thus at a lower trusted computing base. The evaluation shows that during the ML training on CIFAR-10 and real-world medical datasets, Perun achieved a 161x to 1560x speedup compared to a pure TEE-based approach.
翻译:保密的多利益攸关方机器学习(ML)使多个方得以在不透露其知识产权(如ML源代码、模型或数据集)的情况下进行协作数据分析,而同时又不透露其知识产权(如ML源代码、模型或数据集)。基于同质加密的先进解决方案产生了巨大的性能管理。基于硬件的解决方案,如可信赖的执行环境(TEE),大大改进了推论计算性能,但仍在培训计算中表现不佳,例如,深神经网络模型培训,因为保护记忆有限,缺少GPU支持。为解决这一问题,我们设计和实施了保密的多利益攸关方机器学习框架秘鲁,使用户能够在安全和性能之间实现交易。秘鲁对硬件加速器(例如GPU)实施ML培训,同时使用可信赖的计算技术(如可信赖的平台模块和完整性测量架构)提供安全保障。低度的密集工作量,如误判,只在TEEE公司内部执行,因此在低信任的计算机基地执行。评价显示,在ML培训期间,用户可以对安全和性能进行交易。