TY - GEN
T1 - Online Reinforcement Learning with Passive Memory
AU - Pattanaik, Anay
AU - Varshney, Lav R.
N1 - Publisher Copyright:
© 2025 AACC.
PY - 2025
Y1 - 2025
N2 - This paper considers an online reinforcement learning algorithm that leverages pre-collected data (passive memory) from the environment for online interaction. We show that using passive memory improves performance and further provide theoretical guarantees for regret that turns out to be near-minimax optimal. Results show that quality of passive memory determines sub-optimality of the incurred regret. The proposed approach and results hold in both continuous and discrete state-action spaces.
AB - This paper considers an online reinforcement learning algorithm that leverages pre-collected data (passive memory) from the environment for online interaction. We show that using passive memory improves performance and further provide theoretical guarantees for regret that turns out to be near-minimax optimal. Results show that quality of passive memory determines sub-optimality of the incurred regret. The proposed approach and results hold in both continuous and discrete state-action spaces.
UR - https://www.scopus.com/pages/publications/105015665694
U2 - 10.23919/ACC63710.2025.11107904
DO - 10.23919/ACC63710.2025.11107904
M3 - Conference contribution
AN - SCOPUS:105015665694
T3 - Proceedings of the American Control Conference
SP - 3551
EP - 3557
BT - 2025 American Control Conference, ACC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 American Control Conference, ACC 2025
Y2 - 8 July 2025 through 10 July 2025
ER -