Federated Learning for GDPR Compliance

Overview

Federated learning (FL) is a distributed machine learning approach that trains models across multiple data holders without centralising personal data. Instead of collecting training data into a central repository, federated learning sends the model to the data, computes local updates on each participant's device or server, and aggregates only model updates (gradients or weights) at a central coordinator. This architecture directly addresses GDPR data minimisation (Art. 5(1)(c)) and data protection by design (Art. 25) principles by eliminating the need to transfer and centralise personal data for AI training. However, federated learning is not a privacy silver bullet — it introduces its own privacy risks that must be managed through complementary techniques.

Federated Learning Architecture Patterns

Pattern 1: Cross-Device Federated Learning

Use case: Training on data from millions of user devices (smartphones, tablets, IoT).

Component	Description
Participants	End-user devices (smartphones, tablets, wearables)
Scale	Thousands to millions of participants
Data	Small per-device, large aggregate (e.g., keyboard predictions, health metrics)
Coordination	Central server selects participants per round, distributes model, aggregates updates
Communication	Compressed gradient updates over mobile networks

ai-federated-learning

Federated Learning for GDPR Compliance

Overview

Federated Learning Architecture Patterns

Pattern 1: Cross-Device Federated Learning

More from mukul975/privacy-data-protection-skills

thailand-pdpa

privacy-record-linkage

ai-dpia

ai-transparency-reqs

apec-cbpr-cert

42-cfr-part-2