Sciweavers

473 search results - page 60 / 95
» Optimal policy switching algorithms for reinforcement learni...
Sort
View
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
13 years 7 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
CEC
2011
IEEE
12 years 9 months ago
On universal search strategies for multi-criteria optimization using weighted sums
—We develop a stochastic local search algorithm for finding Pareto points for multi-criteria optimization problems. The algorithm alternates between different single-criterium o...
Julien Legriel, Scott Cotton, Oded Maler
GECCO
2005
Springer
111views Optimization» more  GECCO 2005»
14 years 2 months ago
XCS with eligibility traces
The development of the XCS Learning Classifier System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...
Jan Drugowitsch, Alwyn Barry
IJCNN
2006
IEEE
14 years 3 months ago
Learning a Rendezvous Task with Dynamic Joint Action Perception
Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...
Nancy Fulda, Dan Ventura
ICML
1994
IEEE
14 years 16 days ago
Learning Without State-Estimation in Partially Observable Markovian Decision Processes
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...