An Efficient Parallel Secure Machine Learning Framework on GPUs
This is the source code of our TPDS paper entitled "An Efficient Parallel Secure Machine Learning Framework on GPUs" by Zheng Chen, Feng Zhang, Amelie Chi Zhou, Jidong Zhai, Chenyang Zhang, Xiaoyong Du, 2021.
We develop a parallel secure machine learning framework on GPUS using secure multi-party computing.
Machine learning has been widely used in our daily lives. Large amounts of data have been continuously produced and transmitted to the cloud for model training and data processing, which raises a problem: how to preserve the security of the data. Recently, a secure machine learning system named SecureML has been proposed to solve this issue using two-party computation. However, due to the excessive computation expenses of two-party computation, the secure machine learning is about 2x slower than the original machine learning methods. Previous work on secure machine learning mostly focused on novel protocols or improving accuracy, while the performance metric has been ignored. In this paper, we propose a GPU-based framework ParSecureML to improve the performance of secure machine learning algorithms based on two-party computation. The main challenges of developing ParSecureML lie in the complex computation patterns, frequent intra-node data transmission between CPU and GPU, and complicated inter-node data dependence. To handle these challenges, we propose a series of novel solutions, including profiling-guided adaptive GPU utilization, finegrained double pipeline for intra-node CPU-GPU cooperation, and compressed transmission for inter-node communication. As far as we know, this is the first GPU-based secure machine learning framework. Compared to the state-of-the-art framework, ParSecureML achieves an average of 33.8x speedup.
1.Set CUDA path and MPI path in Makefile.
2.Complie all example program
make
You can also only complie an example program, option:
make TestLinear
make TestLogistic
make TestMLP
make TestConv
make TestSVM
make TestRNN
1.cd bin
2.If the slurm system is installed in your environment,
cd slurmrun
If not, you can use mpirun
to run the program,
cd mpirun
3.Run all example program:
bash run.sh
If you just want to run a example,
bash linear.sh
bash logistic.sh
bash mlp.sh
bash conv.sh
bash svm.sh
bash rnn.sh
You can also use other appropriate methods to run. Our program needs three nodes to run.
ParSecureML is developed by Renmin University of China, Shenzhen University, Tsinghua University.
Zheng Chen, Feng Zhang, Chenyang Zhang and Xiaoyong Du are with the Key Laboratory of Data Engineering and Knowledge Engineering (MOE), and School of Information, Renmin University of China.
Amelie Chi Zhou is with the Guangdong Province Engineering Center of China-made High Performance Data Computing System, Shenzhen University.
Jidong Zhai is with the Department of Computer Science and Technology, Tsinghua University, BNRist
If you have any questions, please contact us ([email protected])
If you use our code, please cite our paper:
@article{Zhang2021parsecureml,
title={{An Efficient Parallel Secure Machine Learning Framework on GPUs}},
author={Feng Zhang, Zheng Chen, Chenyang Zhang, Amelie Chi Zhou, Jidong Zhai, Xiaoyong Du},
booktitle={IEEE Transactions on Parallel and Distributed Systems},
year={2021}
}
@inproceedings{chen2020parsecureml,
title={ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs},
author={Chen, Zheng and Zhang, Feng and Zhou, Amelie Chi and Zhai, Jidong and Zhang, Chenyang and Du, Xiaoyong},
booktitle={49th International Conference on Parallel Processing-ICPP},
pages={1--11},
year={2020}
}