TY - GEN
T1 - DeepPursuit
T2 - 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021
AU - Chen, Ziheng
AU - Zhong, Sichen
AU - Chen, Jianshu
AU - Zhao, Yue
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - In this paper, we formulate sparse signal recovery as a sequential decision making problem (modeled by Markov Decision Processes). Based on the formulation, we propose DeepPursuit, a novel sparse recovery algorithm that learns to recover sparse signals via deep reinforcement learning (RL) and Monte Carlo Tree Search (MCTS). To substantially enhance the learning speed and performance, DeepPursuit (i) employs a novel residual-type policy/value network architecture that organically incorporates the classical wisdom from the Orthogonal Matching Pursuit (OMP) algorithm, and (ii) exploits the available ground-truth knowledge to guide the MCTS during the training process. Experimental results for general random sparse signal recovery demonstrate that, with very low computational complexity, the DeepPursuit algorithm significantly outperforms the state-of-the-art algorithms. Even higher performance gains are observed with experiments on the MNIST dataset.
AB - In this paper, we formulate sparse signal recovery as a sequential decision making problem (modeled by Markov Decision Processes). Based on the formulation, we propose DeepPursuit, a novel sparse recovery algorithm that learns to recover sparse signals via deep reinforcement learning (RL) and Monte Carlo Tree Search (MCTS). To substantially enhance the learning speed and performance, DeepPursuit (i) employs a novel residual-type policy/value network architecture that organically incorporates the classical wisdom from the Orthogonal Matching Pursuit (OMP) algorithm, and (ii) exploits the available ground-truth knowledge to guide the MCTS during the training process. Experimental results for general random sparse signal recovery demonstrate that, with very low computational complexity, the DeepPursuit algorithm significantly outperforms the state-of-the-art algorithms. Even higher performance gains are observed with experiments on the MNIST dataset.
UR - https://www.scopus.com/pages/publications/85127081483
U2 - 10.1109/IEEECONF53345.2021.9723110
DO - 10.1109/IEEECONF53345.2021.9723110
M3 - Conference contribution
AN - SCOPUS:85127081483
T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers
SP - 1361
EP - 1366
BT - 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021
A2 - Matthews, Michael B.
PB - IEEE Computer Society
Y2 - 31 October 2021 through 3 November 2021
ER -