loading...
Reinforcement Learning with Inertial Exploration
2007 IEEE/WIC/ACM International Confe ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
In the Q-Learning framework, the exploration of large environment is influenced by the time credit assignment problem. In this context, abstraction techniques may be used. Thus, multi-step actions (MSA) Q-Learning has been proposed to take advantage of the fact that few action switches are usually required in optimal policies. In this article, we propose the concept of inertial exploration, we apply a log-selection of the scales to MSA Q-Learning and we go further by proposing a dynamic time scale approach. We demonstrate that the same improvement in learning speed can be achieved without the full scales set. This improvement is shown on the mountain car problem and on a more realistic application of vehicle control.
Citation:
Dany Bergeron, Charles Desjardins, Julien Laumonier, Brahim Chaib-draa, "Reinforcement Learning with Inertial Exploration," iat,pp.277-280, 2007 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT'07), 2007
Usage of this product signifies your acceptance of the Terms of Use.


Click here to go to beta feedback form