鲁棒多智能体协同对抗策略离线强化学习
张华卿, 张晓飞, 郝明瑞, 姜吉祥, 李闪
Robust multi-agent cooperative confrontation policy offline reinforcement learning
Huaqing ZHANG, Xiaofei ZHANG, Mingrui HAO, Jixiang JIANG, Shan LI
系统工程与电子技术 . 2026, (5): 1670 -1681 .  DOI: 10.12305/j.issn.1001-506X.2026.05.23