Journal of Systems Engineering and Electronics ›› 2010, Vol. 32 ›› Issue (5): 1043-1046.doi: 10.3969/j.issn.1001-506X.2010.05.035
Previous Articles Next Articles
ZHAO Yun, CHEN Qing-wei, HU Wei-li
Online:
Published:
Abstract:
To control the balance between exploration and exploitation, a reinforcement learning algorithm based on information entropy is proposed. A new state importance measure is defined from information entropy and is applied to measure the interrelatedness between state and objectives. Based on this new measure, an exploration mechanism is designed for adjusting the balance between exploration and exploitation adaptively. In addition, an autonomic reduction method is obtained by setting the variable threshold of measure, the size of state space can gradually reduce to a small and adapt space, which will save computing resource and accelerate learning speed. Simulation results indicate the good learning performance of the presented reinforcement learning algorithm.
ZHAO Yun, CHEN Qing-wei, HU Wei-li. Reinforcement learning algorithm based on information entropy[J]. Journal of Systems Engineering and Electronics, 2010, 32(5): 1043-1046.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://www.sys-ele.com/EN/10.3969/j.issn.1001-506X.2010.05.035
https://www.sys-ele.com/EN/Y2010/V32/I5/1043