Search Sciweavers | Sciweavers

651 search results - page 41 / 131

» Algorithms for Inverse Reinforcement Learning

click to vote

ATAL
2008
Springer

160views Intelligent Agents» more ATAL 2008»

Sequential decision making in repeated coalition formation under uncertainty

13 years 11 months ago

Download www.aamas-conference.org

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 10 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

14 years 10 months ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

click to vote

NIPS
2008

149views Information Technology» more NIPS 2008»

Optimization on a Budget: A Reinforcement Learning Approach

13 years 10 months ago

Download www.cs.arizona.edu

Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...

Paul Ruvolo, Ian R. Fasel, Javier R. Movellan

claim paper

Read More »

click to vote

ML
1998
ACM

101views Machine Learning» more ML 1998»

Elevator Group Control Using Multiple Reinforcement Learning Agents

13 years 8 months ago

Download www.clear.rice.edu

Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...

Robert H. Crites, Andrew G. Barto

claim paper

Read More »

« Prev « First page 41 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers