Journal of Systems Engineering and Electronics ›› 2010, Vol. 32 ›› Issue (5): 1043-1046.doi: 10.3969/j.issn.1001-506X.2010.05.035

Previous Articles     Next Articles

Reinforcement learning algorithm based on information entropy

ZHAO Yun,  CHEN Qing-wei,  HU Wei-li   

  1. (School of Automation, Nanjing Univ. of Science and Technology, Nanjing 210094, China)
  • Online:2010-05-24 Published:2010-01-03

Abstract:

To control the balance between exploration and exploitation, a reinforcement learning algorithm based on information entropy is proposed. A new state importance measure is defined from information entropy and is applied to measure the interrelatedness between state and objectives. Based on this new measure, an exploration mechanism is designed for adjusting the balance between exploration and exploitation adaptively. In addition, an autonomic reduction method is obtained by setting the variable threshold of measure, the size of state space can gradually reduce to a small and adapt space, which will save computing resource and accelerate learning speed. Simulation results indicate the good learning performance of the presented reinforcement learning algorithm.

[an error occurred while processing this directive]