A New and Effective Reinforcement Learning based on Tentative Q learning and Knowledge Transfer

WCSE 2018 ISBN: 978-981-11-7861-0
DOI: 10.18178/wcse.2018.06.049

Duan Jun-Hua, Zhu Yi-An, Zhong Dong, Zhan Tao, Luo Shuyan

Abstract— Aiming at the problem of slow learning speed of reinforcement learning, TQL-RWKT-RRL algorithm is put forward, which is based on tentative Q learning and knowledge transfer. Tentative Q learning increases times of exploration in each iteration, and improves updating method of Q value function. Knowledge transfer algorithm realizes knowledge transfer under different state space based on the method of rolling windows. The path planning experiences in the simple small environment is transferred to more complex and larger state space, which speeds up robot path planning learning speed in large and more complex environment.

Index Terms— reinforcement learning, knowledge transfer, rolling windows, robot path-planning

Duan Jun-Hua, Zhu Yi-An, Zhong Dong, Zhan Tao, Luo Shuyan
School of Computer, Northwestern Polytechnical University, CHINA

[Download]

Cite: Duan Jun-Hua, Zhu Yi-An, Zhong Dong, Zhan Tao, Luo Shuyan, "A New and Effective Reinforcement Learning based on Tentative Q learning and Knowledge Transfer," Proceedings of 2018 the 8th International Workshop on Computer Science and Engineering, pp. 276-281, Bangkok, 28-30 June, 2018.

PREVIOUS PAPER

Predicting Peak Service Rate Based On Weather Impacts Using Machine Learning Techniques

NEXT PAPER

Towards Convolutional Neural Network Compression via K-Means Cluster