Skip to main navigation Skip to search Skip to main content

Differentially Private Federated Temporal Difference Learning

  • Stony Brook University

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

This article considers a federated temporal difference (TD) learning algorithm and provides both asymptotic and finite-time analyses. To protect each worker agent's cost information from being acquired by possible attackers, we propose a privacy-preserving variant of the algorithm by adding perturbation to the exchanged information. We show the rigorous differential privacy guarantee by using moments accountant and derive an upper bound of the utility loss for the privacy-preserving algorithm. Evaluations are also provided to corroborate the efficiency of the algorithms.

Original languageEnglish
Pages (from-to)2714-2726
Number of pages13
JournalIEEE Transactions on Parallel and Distributed Systems
Volume33
Issue number11
DOIs
StatePublished - Nov 1 2022

Keywords

  • Multi-agent reinforcement learning
  • TD learning
  • differential privacy
  • federated learning

Fingerprint

Dive into the research topics of 'Differentially Private Federated Temporal Difference Learning'. Together they form a unique fingerprint.

Cite this