1 |
DARPA. Behavior learning for adaptive electronic warfare[EB/OL].[2018-03-23]. http://www.fbo.gov.
|
2 |
DARPA. Communications under extreme RF spectrum conditions[EB/OL].[2018-05-09]. http://www.fbo.gov.
|
3 |
Air Force. Cognitive jammer[EB/OL].[2018-05-09]. http://www.fbo.gov.
|
4 |
DARPA. Adaptive radar countermeasures[EB/OL].[2018-06-26]. https://www.fbo.gov.
|
5 |
孙宏伟, 童宁宁, 孙富君. 基于D-S证据理论的电子干扰模式选择[J]. 弹箭与制导学报, 2003, 23 (2): 218- 220.
|
|
SUN H W , TONG N N , SUN F J . Jamming design selection based on D-S theory[J]. Journal of Projectiles, Rockets, Missiles and Guidance, 2003, 23 (2): 218- 220.
|
6 |
张永顺.复杂电磁环境下基于博弈论的机载雷达对抗仿真研究[D].西安:西安电子科技大学, 2011.
|
|
ZHANG Y S. The research on simulation of airborne radar countermeasures based on game theory in complex electromagnetic environment[D]. Xi'an: Xidian University, 2011.
|
7 |
李云杰, 朱云鹏, 高梅国. 基于Q-学习算法的认知雷达对抗过程设计[J]. 北京理工大学学报, 2015, 35 (11): 1194- 1199.
|
|
LI Y J , ZHU Y P , GAO M G . Design of cognitive radar jamming based on Q-Learning algorithm[J]. Transactions of Beijing Institute of Technology, 2015, 35 (11): 1194- 1199.
|
8 |
邢强, 贾鑫. 基于Q-学习的智能雷达对抗[J]. 系统工程与电子技术, 2018, 40 (5): 1031- 1035.
|
|
XING Q , JIA X . Intelligent radar countermeasure based on Q-learning[J]. Systems Engineering and Electronics, 2018, 40 (5): 1031- 1035.
|
9 |
FARINA A, TIMMONERI L. Live data test of electronic counter-countermeasures (ECCM) on a multifunctional prototype radar[C]//Proc.of the IEEE Metrology for Aerospace, 2016.
|
10 |
马爽.多功能雷达电子情报信号处理关键技术研究[D].长沙:国防科技大学, 2013.
|
|
MA S. Research on ELINT signal processing key technologies for multifunction radar[D]. Changsha: National University of Defense Technology, 2013.
|
11 |
MNIH V , KAVUKCUOGLU K , SILVER D . Human-level control through deep reinforcement learning[J]. Nature, 2015, 518 (7540): 529- 533.
doi: 10.1038/nature14236
|
12 |
DUGGAN M , DUGGAN J , BARRETT E . A reinforcement learning approach for the scheduling of live migration from under utilised hosts[J]. Memetic Computing, 2017, 9 (4): 283- 293.
doi: 10.1007/s12293-016-0218-x
|
13 |
EPPINGER E , WALTER M , SHU C L . Electrophysiological correlates reflect the integration of model-based and model-free decision information[J]. Cognitive, Affective, & Behavioral Neuroscience, 2017, 17 (2): 406- 421.
|
14 |
彭伟. 揭秘深度强化学习[M]. 北京: 北京水利水电出版社, 2018: 39- 61.
|
|
PENG W . Reveal secrets of deep reinforcement learning[M]. Beijing: Beijing Water Resources and Hydropower Press, 2018: 39- 61.
|
15 |
SAJAD H K , SAEED B S , SOROUSH S K . Path planning of modular robots on various terrains using Q-learning versus optimization algorithms[J]. Intelligent Service Robotics, 2017, 10 (2): 121- 136.
doi: 10.1007/s11370-017-0217-x
|
16 |
MATTEO H , JOSEPH M , HAD V H , et al. Rainbow: combining improvements in deep reinforcement learning[J]. Nature, 2017, 1- 9.
|