Systems Engineering and Electronics ›› 2025, Vol. 47 ›› Issue (3): 827-841.doi: 10.12305/j.issn.1001-506X.2025.03.15

• Systems Engineering • Previous Articles    

Hierarchical optimization research of constrained vehicle routing based on deep reinforcement learning

Kaiqiang TANG, Huiqiao FU, Jiasheng LIU, Guizhou DENG, Chunlin CHEN   

  1. School of Engineering Management, Nanjing University, Nanjing 210093, China
  • Received:2024-03-07 Online:2025-03-28 Published:2025-04-18
  • Contact: Chunlin CHEN

Abstract:

For the capacitated vehicle routing problem (CVRP), a method is proposed to decouple the capacity constraints using a hierarchical structure, split the complex CVRP into constraint planning and path planning, and perform deep reinforcement learning (DRL) optimisation for solving the problem respectively. Firstly, the upper layer allocates the vehicle distribution tasks based on the attention model and sampling mechanism to plan the set of subpaths that satisfy the constraints. Secondly, the lower layer adopts the pre-trained unconstrained attention model to plan the paths for the set of subpaths. Finally, the network parameters of the upper layer are optimized through the feedback training and iteration of the Reinforce algorithm. Experimental results show that the method generalizes to CVRP and heterogeneous CVRP tasks of different sizes, outperforms the state-of-the-art DRL method. Moreover, compared with other heuristic methods, in batch computing tasks, the solution speed improved by more than 10 times, while maintaining competitive solutions.

Key words: deep reinforcement learning (DRL), vehicle routing problem (VRP), attention model, hiera-rchical optimization

CLC Number: 

[an error occurred while processing this directive]