Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning

You are here

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning

Xin Wang; Chen Zhao; Tingwen Huang; Prasun Chakrabarti; Jürgen Kurths

In many specific scenarios, accurateand practical cooperative learning is a commonly encountered challenge in multi-agent systems. Thus, the current investigation focuses on cooperative learning algorithms for multi-agent systems and underpins an alternate data-based neural network reinforcement learning framework. To achieve the data-based learning optimization, the proposed cooperative learning framework, which comprises two layers, introduces a virtual learning objective. The followers learn the behaviors of the virtual objects in the first layer based on the adaptive neural networks (NNs). Specifically, the actor and critic NNs are applied to acquire cooperative behaviors and assess this layer's long-term utility function. Then another layer realizes the tracking performance between the virtual objects and the leader by introducing the local data-based performance index. Then, we formulate a resulting deterministic optimization problem and resolve it effectively with the policy iteration algorithm. This intuitive cooperative learning algorithm also preserves good robustness properties and eliminates the dependence on the prior knowledge of the multi-agent system model in the solution process. Finally, a multi-robot formation system demonstrates this promising development's practical appeal and highly effective outcome.


Cooperative learning of multi-agent systems facilitates an agent to perform objectives by interacting with its neighbor agents, which has encountered remarkable growth in past years and will continue to increase, given its capability of improving robustness and efficiency [1][2][3]. Cooperative learning plays a significant role in various fields, including intelligent transportation [4], aerospace systems [5], smart grids [6], etc. Reinforcement learning, as one of the most practical learning branches, has attained substantial attention in the multi-agent systems' cooperative learning community due to its online learning framework and simplicity of implementation [7][8][9]. Besides, reinforcement learning concerns how agents shall select actions in an environment such that some concepts of accumulative reward are maximized, and the environment can be formulated as a Markov decision process.

SPS on Twitter

  • DEADLINE EXTENDED: The 2023 IEEE International Workshop on Machine Learning for Signal Processing is now accepting…
  • ONE MONTH OUT! We are celebrating the inaugural SPS Day on 2 June, honoring the date the Society was established in…
  • The new SPS Scholarship Program welcomes applications from students interested in pursuing signal processing educat…
  • CALL FOR PAPERS: The IEEE Journal of Selected Topics in Signal Processing is now seeking submissions for a Special…
  • Test your knowledge of signal processing history with our April trivia! Our 75th anniversary celebration continues:…

SPS Videos

Signal Processing in Home Assistants


Multimedia Forensics

Careers in Signal Processing             


Under the Radar