Abstract
This article considers a federated temporal difference (TD) learning algorithm and provides both asymptotic and finite-time analyses. To protect each worker agent's cost information from being acquired by possible attackers, we propose a privacy-preserving variant of the algorithm by adding perturbation to the exchanged information. We show the rigorous differential privacy guarantee by using moments accountant and derive an upper bound of the utility loss for the privacy-preserving algorithm. Evaluations are also provided to corroborate the efficiency of the algorithms.
| Original language | English |
|---|---|
| Pages (from-to) | 2714-2726 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Parallel and Distributed Systems |
| Volume | 33 |
| Issue number | 11 |
| DOIs | |
| State | Published - Nov 1 2022 |
Keywords
- Multi-agent reinforcement learning
- TD learning
- differential privacy
- federated learning
Fingerprint
Dive into the research topics of 'Differentially Private Federated Temporal Difference Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver