Sciweavers

4544 search results - page 97 / 909
» Reinforcement Learning with Time
Sort
View
NIPS
1994
13 years 10 months ago
Reinforcement Learning with Soft State Aggregation
It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...
CSL
2010
Springer
13 years 9 months ago
Evaluation of a hierarchical reinforcement learning spoken dialogue system
We describe an evaluation of spoken dialogue strategies designed using hierarchical reinforcement learning agents. The dialogue strategies were learnt in a simulated environment a...
Heriberto Cuayáhuitl, Steve Renals, Oliver ...
CORR
2010
Springer
105views Education» more  CORR 2010»
13 years 7 months ago
Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence
We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...
Sarah Filippi, Olivier Cappé, Aurelien Gari...
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
13 years 6 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
ICRA
2007
IEEE
147views Robotics» more  ICRA 2007»
14 years 3 months ago
Neural Reinforcement Learning Controllers for a Real Robot Application
— Accurate and fast control of wheel speeds in the presence of noise and nonlinearities is one of the crucial requirements for building fast mobile robots, as they are required i...
Roland Hafner, Martin Riedmiller