Differential Privacy for Federated Learning with Secure Aggregation

The goal of this project is to create a system for federated machine learning where differential privacy of any individual client's data can be guaranteed, using secure aggregation while providing global differential privacy.

See our paper for an approachable summary and details!

Motivation

Machine learning aims to automatically improve a model given sample data. Many problems in the field are formulated as the search for a parameter set to the model that is is optimal w.r.t. some objective function. If the model is suitable (like the ubiquitous neural network models are), variants of stochastic gradient descent (SGD) are frequently used to solve said optimization problem.

In a nutshell, SGD computes the gradient of the model together with the objective function w.r.t. the current parameter set at all points from a random subset of the training data. The parameters are then updated in the opposite direction of the gradient sum. Intuitively, this moves the parameters in the direction of approximately the steepest ascent, hence hopefully towards a local maximum, of the objective function. This process is iterated until performance is statisfactory.

Two privacy issues that arise when training on sensitive data are considered here:

The party executing the SGD computation has access to all data.
The model resulting from the training may be vulnerable to inference attacks, even for an adversary that only has query access.

Approach

We suggest the following solution:

Federated learning with secure aggregation. The gradients are computed locally by the data owners (referred to as "clients" hereafter). The gradient sum is computed by servers following a secure aggregation protocol where noone but the client ever sees their data in plain text. The model update is then performed publicly and used for the next iteration.
Differential privacy (DP). A calibrated amount of noise is added to the gradients in each update step, trading model performance for protection of client privacy.
- The straightforward way of doing federated private learning is local DP, making the clients add the noise locally before transmitting to the aggregation servers. This is great because it does not require trust in the aggregators. It has downsides though: The total noise is much larger than strictly necessary, as each client has to add the total amount of noise in case another client maliciously refrains from adding any noise. This affects model performance.
- Global DP to the rescue! The aggregation server adds the noise after computing the gradient sum. An arising problem in the federated setting is that the noise is calibrated under the assumption that the L2 norm of the gradients is bounded by 1, but the server only ever sees the gradient in cyphertext and cannot check. Luckily, the VDAF protocols allow the aggregator to compute certain verification functions on client submissions without knowing the plain text. This can be leveraged to ensure the norm bound server-side.

Setup

Threat model

The following entities participate:

The clients hold sensitive data on which the machine learning task is supposed to be executed.
The aggregation servers perform gradient aggregation without seeing plaintext submissions.
The ML server holds the current model, updates according to gradient aggregates, and distributes the update.

We aim to provide the following privacy guarantees:

Anonymity (no adversary can tell which client submitted which data value) and privacy (no adversary learns anything about an honest clients' data values except the aggregate) an be guaranteed if
- all clients are malicious
- at least one aggregation server is honest but curious, the remaining ones are malicious
- the ML server is malicious
Differential privacy can be guaranteed if
- all clients are malicious
- all aggregation servers are honest but curious
- the ML server is malicious

NOTE: The reason for requiring all aggregation servers to be honest is that prio only guarantees robustness in such a case (see, e.g., the prio paper). Robustness is required to know that all submitted vectors are clipped as intended.

Correctness of the result of the learning procedure requires honesty of all participants. As differential privacy persists even for malicious clients, the learning result is guaranteed to at least be robust towards data poisoning in that case.

How it works

The ML server distributes its current model to the clients.
Each client locally computes the gradient vector for that model based on its data.
Each client splits its gradient vector into gradient shares and submits a share to each aggregation server.
The aggregation servers verify that the submitted vectors are well-formed (clipped, with L2 norm less than 1). This is done in a distributed way, without any knowledge being gained about the values of the clients' submissions.
Each aggregation server adds noise to the clients' shares to provide pre-established privacy guarantees.
The aggregation servers compute the aggregate gradient as a sum of all client gradients, again in a distributed fashion. The aggregate contains noise from all the aggregation servers and is sent to the ML server.
The ML server updates its model and can initiate a new training round.

Roadmap

For aggregation of gradient vectors we use prio-rs with fixed-point vectors. Its mechanism for zero-knowledge proofs on secret-shared data (which is used for verifying that gradient vectors are bounded) is described on the prio webpage and in the Zero-Knowledge Proofs on Secret-Shared Data via Fully Linear PCPs paper. This was done in our pull request to prio!
Integrate our new type into janus which will provide the client/server infrastructure. This was done in out pull request to janus!
Implement discrete Gaussian sampling and use it to add Differential Privacy to our prio type. This was done on our prio fork!
Use this with the flower framework for federated learning in python. This was done in our dpsa4fl example project!
Get our additions to janus and libprio accepted as pull requests.

Implementation

This projects involves many repositories, the dependencies between them are as follows.

Please note that libprio-rs and janus are not part of this project, we merely integrate some changes required for our use-case upstream.

libprio-rs: we define a prio3 type with the purpose of securely aggregating gradient vectors from clients. This code is integrated upstream in the libprio-rs repository.
janus: we add the necessary plumbing code for our new type to janus.
dpsa4fl depends on janus, it contains the core of our project: code necessary to interact with janus servers specifically in the setting of federated learning.
dpsa4fl-bindings.py is a wrapper around dpsa4fl, and is released as a python package that can be downloaded from PyPi.
dpsa4fl-infrastructure contains setup instructions for local and distributed deployment of the janus infrastructure required for our project.
dpsa4flower implements client and server to use the flower framework for federation.
dpsa4fl-example-project is a fully working example of how to use dpsa4fl (via our python bindings) with the flower framework for differentially private federated learning.

Funding

This project is funded through the NGI Assure Fund, a fund established by NLnet with financial support from the European Commission's Next Generation Internet program. Learn more on the NLnet project page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Differential Privacy for Federated Learning with Secure Aggregation

Motivation

Approach

Setup

Threat model

How it works

Roadmap

Implementation

Funding

Files

README.md

Latest commit

History

README.md

File metadata and controls

Differential Privacy for Federated Learning with Secure Aggregation

Motivation

Approach

Setup

Threat model

How it works

Roadmap

Implementation

Funding