鲁棒多智能体协同对抗策略离线强化学习
张华卿, 张晓飞, 郝明瑞, 姜吉祥, 李闪
Robust multi-agent cooperative confrontation policy offline reinforcement learning
Huaqing ZHANG, Xiaofei ZHANG, Mingrui HAO, Jixiang JIANG, Shan LI
系统工程与电子技术
.
2026, (5): 1670
-1681
.
DOI: 10.12305/j.issn.1001-506X.2026.05.23