Systems Engineering and Electronics ›› 2021, Vol. 43 ›› Issue (11): 3338-3351.doi: 10.12305/j.issn.1001-506X.2021.11.35

• Communications and Networks • Previous Articles     Next Articles

Review of multi-agent reinforcement learning based dynamic spectrum allocation method

Bo SONG1,2,*, Wei YE1, Xianghui MENG2   

  1. 1. Department of Electronic and Optical Engineering, Space Engineering University, Beijing 101416, China
    2. Unit 95801 of the PLA, Beijing 100076, China
  • Received:2021-01-13 Online:2021-11-01 Published:2021-11-12
  • Contact: Bo SONG

Abstract:

Cognitive radio and dynamic spectrum allocation technology are effective means to solve the scarcity of spectrum. With the rapid development of machine learning technology including deep learning and reinforcement learning in recent years, the swarm intelligence technology represented by multi-agent reinforcement learning is continuously making breakthroughs, which is also making distributed and intelligent dynamic spectrum allocation possible. This paper reviews the key research achievements in reinforcement learning and multi-agent reinforcement learning in detail, as well as research in modeling methods and algorithms of dynamic spectrum allocation process based on multi-agent reinforcement learning. The method could boil down to four types: independent Q-learning, cooperating Q-learning, joint Q-learning and multi-agent actor-critic. The advantages and disadvantages of the existing four types of methods are analyzed, and the critical problems and possible solutions of the dynamic spectrum allocation method based on multi-agent reinforcement learning are summarized.

Key words: spectrum management, cognitive radio, dynamic spectrum allocation, machine learning, reinforcement learning, multi-agent reinforcement learning

CLC Number: 

[an error occurred while processing this directive]