Distributed Multiagent Reinforcement Learning Based on Graph-Induced Local Value Functions
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 69(10), 6636–6651.
author keywords: Couplings; Heuristic algorithms; Convergence; Approximation algorithms; Scalability; Reinforcement learning; Indexes; Distributed learning; Markov decision process; multiagent systems; optimal control; reinforcement learning