Skip to main navigation Skip to search Skip to main content

Online Reinforcement Learning with Passive Memory

  • University of Illinois at Urbana-Champaign

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper considers an online reinforcement learning algorithm that leverages pre-collected data (passive memory) from the environment for online interaction. We show that using passive memory improves performance and further provide theoretical guarantees for regret that turns out to be near-minimax optimal. Results show that quality of passive memory determines sub-optimality of the incurred regret. The proposed approach and results hold in both continuous and discrete state-action spaces.

Original languageEnglish
Title of host publication2025 American Control Conference, ACC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3551-3557
Number of pages7
ISBN (Electronic)9798331569372
DOIs
StatePublished - 2025
Event2025 American Control Conference, ACC 2025 - Denver, United States
Duration: Jul 8 2025Jul 10 2025

Publication series

NameProceedings of the American Control Conference
ISSN (Print)0743-1619

Conference

Conference2025 American Control Conference, ACC 2025
Country/TerritoryUnited States
CityDenver
Period07/8/2507/10/25

Fingerprint

Dive into the research topics of 'Online Reinforcement Learning with Passive Memory'. Together they form a unique fingerprint.

Cite this