Search Sciweavers | Sciweavers

615 search results - page 35 / 123

» A social reinforcement learning agent

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

13 years 11 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

click to vote

CORR
2011
Springer

194views Education» more CORR 2011»

Accelerating Reinforcement Learning through Implicit Imitation

13 years 1 months ago

Download www.aaai.org

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent’s ability to learn useful behaviors by making intelligent use of the kn...

Craig Boutilier, Bob Price

claim paper

Read More »

click to vote

ICCBR
2009
Springer

134views Automated Reasoning» more ICCBR 2009»

Improving Reinforcement Learning by Using Case Based Heuristics

14 years 4 months ago

Download www.iiia.csic.es

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and ...

Reinaldo A. C. Bianchi, Raquel Ros, Ramon Ló...

claim paper

Read More »

click to vote

FLAIRS
1998

130views Artificial Intelligence» more FLAIRS 1998»

Learning to Race: Experiments with a Simulated Race Car

13 years 11 months ago

Download www.aaai.org

Our focus is on designing adaptable agents for highly dynamic environments. Wehave implementeda reinforcement learning architecture as the reactive componentof a twolayer control ...

Larry D. Pyeatt, Adele E. Howe

claim paper

Read More »

click to vote

ICML
2004
IEEE

161views Machine Learning» more ICML 2004»

Using relative novelty to identify useful temporal abstractions in reinforcement learning

14 years 10 months ago

Download www.cs.umass.edu

lative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning ?Ozg?ur S?im?sek ozgur@cs.umass.edu Andrew G. Barto barto@cs.umass.edu Department of Computer Scie...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 35 / 123 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers