Search Sciweavers | Sciweavers

473 search results - page 28 / 95

» Programmable Reinforcement Learning Agents

click to vote

ICML
2006
IEEE

153views Machine Learning» more ICML 2006»

Autonomous shaping: knowledge transfer in reinforcement learning

14 years 9 months ago

Download www-all.cs.umass.edu

We introduce the use of learned shaping rewards in reinforcement learning tasks, where an agent uses prior experience on a sequence of tasks to learn a portable predictor that est...

George Konidaris, Andrew G. Barto

claim paper

Read More »

click to vote

JMLR
2002

125views more JMLR 2002»

Lyapunov Design for Safe Reinforcement Learning

13 years 8 months ago

Download www-anw.cs.umass.edu

Lyapunov design methods are used widely in control engineering to design controllers that achieve qualitative objectives, such as stabilizing a system or maintaining a system'...

Theodore J. Perkins, Andrew G. Barto

claim paper

Read More »

click to vote

AAAI
1993

107views Intelligent Agents» more AAAI 1993»

Complexity Analysis of Real-Time Reinforcement Learning

13 years 10 months ago

Download www.ri.cmu.edu

This paper analyzes the complexity of on-line reinforcement learning algorithms, namely asynchronous realtime versions of Q-learning and value-iteration, applied to the problem of...

Sven Koenig, Reid G. Simmons

claim paper

Read More »

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

13 years 11 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

click to vote

AAAI
1998

122views Intelligent Agents» more AAAI 1998»

A Framework for Reinforcement Learning on Real Robots

13 years 10 months ago

Download www.cs.wustl.edu

Learning on real robots in an real, unaltered environment provides an extremely challenging problem. Many of the simplifying assumptions made in other areas of learning cannot be ...

William D. Smart, Leslie Pack Kaelbling

claim paper

Read More »

« Prev « First page 28 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers