基于强化学习的改进三维A*算法在线航迹规划
任智, 张栋, 唐硕

Improved three-dimensional A* algorithm of real-time path planning based on reinforcement learning
Zhi REN, Dong ZHANG, Shuo TANG
图2 Actor-Critic双网络架构示意图
Fig.2 Schematic diagram of Actor-Critic network