xaynet
Xaynet: Train on the Edge with Federated Learning
Want a framework that supports federated learning on the edge, in desktop browsers, integrates well with mobile apps, is performant, and preserves privacy? Welcome to XayNet, written entirely in Rust!
Making federated learning easy for developers
Frameworks for machine learning - including those expressly for federated learning - exist already. These frameworks typically facilitate federated learning of cross-silo use cases - for example in collaborative learning across a limited number of hospitals or for instance across multiple banks working on a common use case without the need to share valuable and sensitive data.
This repository focusses on masked cross-device federated learning to enable the orchestration of machine learning in millions of low-power edge devices, such as smartphones or even cars. By doing this, we hope to also increase the pace and scope of adoption of federated learning in practice and especially allow the protection of end user data. All data remains in private local premises, whereby only encrypted AI models get automatically and asynchronously aggregated. Thus, we provide a solution to the AI privacy dilemma and bridge the often-existing gap between privacy and convenience. Imagine, for example, a voice assistant to learn new words directly on device level and sharing this knowledge with all other instances, without recording and collecting your voice input centrally. Or, think about search engine that learns to personalise search results without collecting your often sensitive search queries centrally… There are thousands of such use cases that right today still trade privacy for convenience. We think this shouldn’t be the case and we want to provide an alternative to overcome this dilemma.
Concretely, we provide developers with:
- App dev tools: An SDK to integrate federated learning into apps written in Dart or other languages of choice for mobile development, as well as frameworks like Flutter.
- Privacy via cross-device federated learning: Train your AI models locally on edge devices such as mobile phones, browsers, or even in cars. Federated learning automatically aggregates the local models into a global model. Thus, all insights inherent in the local models are captured, while the user data stays private on end devices.
- Security Privacy via homomorphic encryption: Aggregate models with the highest security and trust. Xayn’s masking protocol encrypts all models homomorphically. This enables you to aggregate encrypted local models into a global one – without having to decrypt local models at all. This protects private and even the most sensitive data.
The case for writing this framework in Rust
Our framework for federated learning is not only a framework for machine learning as such. Rather, it supports the federation of machine learning that takes place on possibly heterogeneous devices and where use cases involve many such devices.
The programming language in which this framework is written should therefore give us strong support for the following:
- Runs "everywhere": the language should not require its own runtime and code should compile on a wide range of devices.
- Memory and concurrency safety: code that compiles should be both memory safe and free of data races.
- Secure communication: state of the art cryptography should be available in vetted implementations.
- Asynchronous communication: abstractions for asynchronous communication should exist that make federated learning scale.
- Fast and functional: the language should offer functional abstractions but also compile code into fast executables.
Rust is one of the very few choices of modern programming languages that meets these requirements:
- its concepts of Ownership and Borrowing make it both memory and thread-safe (hence avoiding many common concurrency issues).
- it has a strong and static type discipline and traits, which describe shareable functionality of a type.
- it is a modern systems programming language, with some functional style features such as pattern matching, closures and iterators.
- its idiomatic code compares favourably to idiomatic C in performance.
- it compiles to WASM and can therefore be applied natively in browser settings.
- it is widely deployable and doesn't necessarily depend on a runtime, unlike languages such as Java and their need for a virtual machine to run its code. Foreign Function Interfaces support calls from other languages/frameworks, including Dart, Python and Flutter.
- it compiles into LLVM, and so it can draw from the abundant tool suites for LLVM.
Getting Started
Minimum supported rust version
rustc 1.48.0
Running the platform
There are a few different ways to run the backend: via docker, or by deploying it to a Kubernetes cluster or by compiling the code and running the binary manually.
- Everything described below assumes your shell's working directory to be the root of the repository.
- The following instructions assume you have pre-existing knowledge on some of the referenced software (like
docker
anddocker-compose
) and/or a working setup (if you decide to compile the Rust code and run the binary manually). - In case you need help with setting up your system accordingly, we recommend you refer to the official documentation of each tool, as supporting them here would be beyond the scope of this project:
Note:
With Xaynet v0.11
the coordinator needs a connection to a redis instance in order to save its state.
Don't connect the coordinator to a Redis instance that is used in production!
We recommend connecting the coordinator to its own Redis instance. We have invested a lot of time to make sure that the coordinator only deletes its own data but in the current state of development, we cannot guarantee that this will always be the case.
Using Docker
The convenience of using the docker setup is that there's no need to setup a working Rust environment on your system, as everything is done inside the container.
Run an image from Docker Hub
Docker images of the latest releases are provided on Docker Hub.
You can try them out with the default configs/docker-dev.toml
by running:
Xaynet below v0.11
docker run -v ${PWD}/configs/docker-dev.toml:/app/config.toml -p 8081:8081 xaynetwork/xaynet:v0.10.0 /app/coordinator -c /app/config.toml
Xaynet v0.11+
# don't forget to adjust the Redis url in configs/docker-dev.toml
docker run -v ${PWD}/configs/docker-dev.toml:/app/config.toml -p 8081:8081 xaynetwork/xaynet:v0.11.0
The docker image contains a release build of the coordinator without optional features.
Run a coordinator with additional infrastructure
Start the coordinator by pointing to the docker/docker-compose.yml
file. It spins up all infrastructure that is essential to run the coordinator with default or optional features. Keep in mind that this file is used for development only.
docker-compose -f docker/docker-compose.yml up --build
Create a release build
If you would like, you can create an optimized release build of the coordinator, but keep in mind that the compilation will be slower.
docker build --build-arg RELEASE_BUILD=1 -f ./docker/Dockerfile .
Build a coordinator with optional features
Optional features can be specified via the build argument COORDINATOR_FEATURES
.
docker build --build-arg COORDINATOR_FEATURES=tls,metrics -f ./docker/Dockerfile .
Using Kubernetes
To deploy an instance of the coordinator to your Kubernetes cluster, use the manifests that are located inside the k8s/coordinator
folder. The manifests rely on kustomize
to be generated (kustomize
is officially supported by kubectl
since v1.14). We recommend you thoroughly go through the manifests and adjust them according to your own setup (namespace, ingress, etc.).
Remember to also check (and adjust if necessary) the default configuration for the coordinator, available at k8s/coordinator/development/config.toml
.
Please adjust the domain used in the k8s/coordinator/development/ingress.yaml
file so it matches your needs (you can also skip ingress
altogether, just make sure you remove its reference from k8s/coordinator/development/kustomization.yaml
).
Keep in mind that the ingress
configuration that is shown on k8s/coordinator/development/ingress.yaml
relies on resources that aren't available in this repository, due to their sensitive nature (TLS key and certificate, for instance).
To verify the generated manifests, run:
kubectl kustomize k8s/coordinator/development
To apply them:
kubectl apply -k k8s/coordinator/development
In case you are not exposing your coordinator via ingress
, you can still reach it using a port-forward. The example below creates a port-forward at port 8081
assuming the coordinator pod is still using the app=coordinator
label:
kubectl port-forward $(kubectl get pods -l "app=coordinator" -o jsonpath="{.items[0].metadata.name}") 8081
Building the project manually
The coordinator without optional features can be built and started with:
cd rust
cargo run --bin coordinator -- -c ../configs/config.toml
Running the example
The example can be found under rust/examples/. It uses a dummy model but is network-capable, so it's a good starting point for checking connectivity with the coordinator.
test-drive
Make sure you have a running instance of the coordinator and that the clients you will spawn with the command below are able to reach it through the network.
Here is an example on how to start 20
participants that will connect to a coordinator running on 127.0.0.1:8081
:
cd rust
RUST_LOG=info cargo run --example test-drive -- -n 20 -u http://127.0.0.1:8081
For more in-depth details on how to run examples, see the accompanying Getting Started guide under rust/xaynet-server/src/examples.rs.
Troubleshooting
If you have any difficulties running the project, please reach out to us by opening an issue and describing your setup and the problems you're facing.