Cooperative learning, that enables two or more data owners to jointly train a model, has been widely adopted to solve the problem of insufficient training data in machine learning. Nowadays, there is an urgent need for institutions and organizations to train a model cooperatively while keeping each other's data privately. To address the issue of privacy-preserving in collaborative learning, secure outsourced computation and federated learning are two typical methods. Nevertheless, there are many drawbacks for these two methods when they are leveraged in cooperative learning. For secure outsourced computation, semi-honest servers need to be introduced. Once the outsourced servers collude or perform other active attacks, the privacy of data will be disclosed. For federated learning, it is difficult to apply to the scenarios where vertically partitioned data are distributed over multiple parties. In this work, we propose a multi-party mixed protocol framework, ABG$^n$, which effectively implements arbitrary conversion between Arithmetic sharing (A), Boolean sharing (B) and Garbled-Circuits sharing (G) for $n$-party scenarios. Based on ABG$^n$, we design a privacy-preserving multi-party cooperative learning system, which allows different data owners to cooperate in machine learning in terms of data security and privacy-preserving. Additionally, we design specific privacy-preserving computation protocols for some typical machine learning methods such as logistic regression and neural networks. Compared with previous work, the proposed method has a wider scope of application and does not need to rely on additional servers. Finally, we evaluate the performance of ABG$^n$ on the local setting and on the public cloud setting. The experiments indicate that ABG$^n$ has excellent performance, especially in the network environment with low latency.
翻译:合作学习使两个或两个以上数据所有者能够联合培训模型,已经广泛采用合作学习,以解决机器学习培训数据不足的问题。如今,各机构和组织迫切需要合作培训模型,同时将彼此的数据私自保存。为了解决合作学习中的隐私保护问题,安全的外包计算和联合学习是两种典型方法。然而,在合作学习中利用这两种方法时,这两类方法有许多缺点。为了安全外包计算,需要采用半诚实服务器。一旦外包服务器串通或执行其他主动攻击,数据隐私将被披露。对于联合学习来说,很难将模型用于合作培训模式,同时将彼此的数据私自保存在多个缔约方之间。在这项工作中,我们提出多党混合协议框架,AB$,在合作分享(B)和Garbled-Ciruts(G)之间实现任意转换。在以美元为单位的假设情景下,在以美元为单位的常规服务器上,我们设计了一个精细的网络,在以B=美元为单位,我们设计了一个精细的系统, 将一个精细的系统用于学习特定的保密数据操作。