Search Sciweavers | Sciweavers

473 search results - page 34 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

14 years 4 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

14 years 8 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

click to vote

AAAI
2010

134views Intelligent Agents» more AAAI 2010»

Reinforcement Learning Via Practice and Critique Advice

13 years 9 months ago

Download web.engr.oregonstate.edu

We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our setting, the learner alternates between practicing, where learning is based on ac...

Kshitij Judah, Saikat Roy, Alan Fern, Thomas G. Di...

claim paper

Read More »

click to vote

ICML
2003
IEEE

129views Machine Learning» more ICML 2003»

Relativized Options: Choosing the Right Transformation

14 years 8 months ago

Download www-anw.cs.umass.edu

Relativized options combine model minimization methods and a hierarchical reinforcement learning framework to derive compact reduced representations of a related family of tasks. ...

Balaraman Ravindran, Andrew G. Barto

claim paper

Read More »

click to vote

AIIA
2007
Springer

147views Artificial Intelligence» more AIIA 2007»

Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions

14 years 1 months ago

Download sequel.futurs.inria.fr

The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...

Andrea Bonarini, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

« Prev « First page 34 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers