Systems Engineering and Electronics ›› 2022, Vol. 44 ›› Issue (7): 2311-2318.doi: 10.12305/j.issn.1001-506X.2022.07.28

• Communications and Networks • Previous Articles     Next Articles

Network routing optimization approach based on deep reinforcement learning

Lingyu MENG1,2, Bingli GUO1,2,*, Wen YANG1,2, Xinwei ZHANG1,2, Zuoqing ZHAO1,2, Shanguo HUANG1,2   

  1. 1. School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
    2. State Key Laboratory of Information Photonics and Optical Communication, Beijing 100876, China
  • Received:2021-06-29 Online:2022-06-22 Published:2022-06-28
  • Contact: Bingli GUO

Abstract:

Aiming at the routing optimization problem of different network loads under the same network topology, based on the deep reinforcement learning method, two optimization methods for routing distribution based on the current network traffic state are proposed. Through the iterative interaction between the network simulation system and the deep reinforcement learning model, continuous training and optimization of network routing for the distribution of traffic relationships are realized. Improvements have been made in using the deep deterministec policy gradient (DDPG) algorithm to solve the routing optimization problem, making this optimization method more suitable for solving the problem of network routing optimization. At the same time, a brand-new link weight construction strategy is designed, which uses network traffic to construct input state elements for the neural network. Through the preprocessing of the original data, the learning efficiency of the neural network is strengthened, and the stability of the training model is greatly improved. And for the continuous action space of the high-latitude large-scale network, the action space is discretized, which effectively reduces the complexity of the action space and speeds up the model convergence. Experimental results show that the proposed optimization method can adapt to changing traffic and link status, enhance the stability of model training and improve network performance.

Key words: deep reinforcement learning, routing optimization, deep deterministec policy gradient (DDPG) algorithm

CLC Number: 

[an error occurred while processing this directive]