TY - JOUR
T1 - An Efficient Parallel Secure Machine Learning Framework on GPUs
AU - Zhang, Feng
AU - Chen, Zheng
AU - Zhang, Chenyang
AU - Zhou, Amelie Chi
AU - Zhai, Jidong
AU - Du, Xiaoyong
N1 - Funding information:
This work was supported by the National R&D Program of China under Grant 2020AAA0105200, in part by the National Natural Science Foundation of China under Grant U20A20226, Grant 61802412, Grant 61802260, Grant 61972403, and Grant 61732014, in part by the Beijing Natural Science Foundation under Grant 4202031 and Grant L192027, in part by the Beijing Academy of Artificial Intelligence (BAAI), and in part by the Tsinghua University-Peking Union Medical College Hospital Initiative Scientific Research Program. The work of Amelie Chi Zhou was also supported by the Shenzhen Science and Technology Foundation under Grant JCYJ20180305125737520 and a Tencent “Rhinoceros Birds” project of Scientific Research Foundation for Young Teachers of Shenzhen University.
Publisher Copyright:
© 2021 IEEE.
PY - 2021/9/1
Y1 - 2021/9/1
N2 - Machine learning is widely used in our daily lives. Large amounts of data have been continuously produced and transmitted to the cloud for model training and data processing, which raises a problem: how to preserve the security of the data. Recently, a secure machine learning system named SecureML has been proposed to solve this issue using two-party computation. However, due to the excessive computation expenses of two-party computation, the secure machine learning is about 2× slower than the original machine learning methods. Previous work on secure machine learning mostly focused on novel protocols or improving accuracy, while the performance metric has been ignored. In this article, we propose a GPU-based framework ParSecureML to improve the performance of secure machine learning algorithms based on two-party computation. The main challenges of developing ParSecureML lie in the complex computation patterns, frequent intra-node data transmission between CPU and GPU, and complicated inter-node data dependence. To handle these challenges, we propose a series of novel solutions, including profiling-guided adaptive GPU utilization, fine-grained double pipeline for intra-node CPU-GPU cooperation, and compressed transmission for inter-node communication. Moreover, we integrate architecture specific optimizations, such as Tensor Cores, into ParSecureML. As far as we know, this is the first GPU-based secure machine learning framework. Compared to the state-of-the-art framework, ParSecureML achieves an average of 33.8× speedup. ParSecureML can also be applied to inferences, which achieves 31.7× speedup on average.
AB - Machine learning is widely used in our daily lives. Large amounts of data have been continuously produced and transmitted to the cloud for model training and data processing, which raises a problem: how to preserve the security of the data. Recently, a secure machine learning system named SecureML has been proposed to solve this issue using two-party computation. However, due to the excessive computation expenses of two-party computation, the secure machine learning is about 2× slower than the original machine learning methods. Previous work on secure machine learning mostly focused on novel protocols or improving accuracy, while the performance metric has been ignored. In this article, we propose a GPU-based framework ParSecureML to improve the performance of secure machine learning algorithms based on two-party computation. The main challenges of developing ParSecureML lie in the complex computation patterns, frequent intra-node data transmission between CPU and GPU, and complicated inter-node data dependence. To handle these challenges, we propose a series of novel solutions, including profiling-guided adaptive GPU utilization, fine-grained double pipeline for intra-node CPU-GPU cooperation, and compressed transmission for inter-node communication. Moreover, we integrate architecture specific optimizations, such as Tensor Cores, into ParSecureML. As far as we know, this is the first GPU-based secure machine learning framework. Compared to the state-of-the-art framework, ParSecureML achieves an average of 33.8× speedup. ParSecureML can also be applied to inferences, which achieves 31.7× speedup on average.
KW - GPU acceleration
KW - machine learning
KW - secure inference
KW - secure training
KW - Two-party computation
UR - https://www.scopus.com/pages/publications/85100865669
U2 - 10.1109/TPDS.2021.3059108
DO - 10.1109/TPDS.2021.3059108
M3 - Journal article
AN - SCOPUS:85100865669
SN - 1045-9219
VL - 32
SP - 2262
EP - 2276
JO - IEEE Transactions on Parallel and Distributed Systems
JF - IEEE Transactions on Parallel and Distributed Systems
IS - 9
M1 - 9354058
ER -