Search Sciweavers | Sciweavers

128 search results - page 4 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

193

Voted

COLT
2007
Springer

143views Machine Learning» more COLT 2007»

Bounded Parameter Markov Decision Processes with Average Reward Criterion

16 years 28 days ago

Download ttic.uchicago.edu

Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with uncertainty in the parameters of a Markov Decision Process (MDP). Unlike the case of an MDP, t...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

204

Voted

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

15 years 4 months ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

206

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

204

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 7 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

179

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 8 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

« Prev « First page 4 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers