Sciweavers

1233 search results - page 45 / 247
» Reinforcement learning
Sort
View
ICML
2009
IEEE
14 years 10 months ago
The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning
The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an ...
Carlos Diuk, Lihong Li, Bethany R. Leffler
COLT
2000
Springer
14 years 2 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
IJAIT
2008
146views more  IJAIT 2008»
13 years 10 months ago
Learning to Behave in Space: a Qualitative Spatial Representation for Robot Navigation with Reinforcement Learning
ion mechanism to create a representation of space consisting of the circular order of detected landmarks and the relative position of walls towards the agent's moving directio...
Lutz Frommberger
ATAL
2009
Springer
14 years 4 months ago
Learning with whom to communicate using relational reinforcement learning
Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, ...