一种基于信息熵的强化学习算法

doi:10.3969/j.issn.1001-506X.2010.05.035

Journal of Systems Engineering and Electronics ›› 2010, Vol. 32 ›› Issue (5): 1043-1046.doi: 10.3969/j.issn.1001-506X.2010.05.035

Previous Articles Next Articles

Reinforcement learning algorithm based on information entropy

ZHAO Yun, CHEN Qing-wei, HU Wei-li

(School of Automation, Nanjing Univ. of Science and Technology, Nanjing 210094, China)

Online:2010-05-24 Published:2010-01-03

Abstract

Abstract:

To control the balance between exploration and exploitation, a reinforcement learning algorithm based on information entropy is proposed. A new state importance measure is defined from information entropy and is applied to measure the interrelatedness between state and objectives. Based on this new measure, an exploration mechanism is designed for adjusting the balance between exploration and exploitation adaptively. In addition, an autonomic reduction method is obtained by setting the variable threshold of measure, the size of state space can gradually reduce to a small and adapt space, which will save computing resource and accelerate learning speed. Simulation results indicate the good learning performance of the presented reinforcement learning algorithm.

ZHAO Yun, CHEN Qing-wei, HU Wei-li. Reinforcement learning algorithm based on information entropy[J]. Journal of Systems Engineering and Electronics, 2010, 32(5): 1043-1046.

Reinforcement learning algorithm based on information entropy

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 0

Recommended Articles

Metrics

Comments