Sciweavers

1630 search results - page 183 / 326
» Coordinated Reinforcement Learning
Sort
View
COMAD
2008
15 years 5 months ago
Personalized Web-page Rendering System
Personalized rendering of web pages gives the users greater control to view only what they prefer. The goal of this work is to provide a tool that will let users customize the con...
Swapna Raj Prabakara Raj, Balaraman Ravindran
ML
2002
ACM
100views Machine Learning» more  ML 2002»
15 years 3 months ago
Structure in the Space of Value Functions
Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental ...
David J. Foster, Peter Dayan
SMC
2007
IEEE
102views Control Systems» more  SMC 2007»
15 years 10 months ago
An improved immune Q-learning algorithm
—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...
Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...
IROS
2006
IEEE
113views Robotics» more  IROS 2006»
15 years 10 months ago
Policy Gradient Methods for Robotics
— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...
Jan Peters, Stefan Schaal
ATAL
2008
Springer
15 years 6 months ago
Expediting RL by using graphical structures
The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...
Peng Dai, Alexander L. Strehl, Judy Goldsmith