Search Sciweavers | Sciweavers

683 search results - page 121 / 137

» Coarticulation in Markov Decision Processes

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

13 years 9 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

NIPS
2001

158views Information Technology» more NIPS 2001»

Multiagent Planning with Factored MDPs

13 years 9 months ago

Download books.nips.cc

We present a principled and efficient planning algorithm for cooperative multiagent dynamic systems. A striking feature of our method is that the coordination and communication be...

Carlos Guestrin, Daphne Koller, Ronald Parr

claim paper

Read More »

click to vote

IJCAI
2003

137views Artificial Intelligence» more IJCAI 2003»

Approximating Optimal Policies for Agents with Limited Execution Resources

13 years 9 months ago

Download ai.stanford.edu

An agent with limited consumable execution resources needs policies that attempt to achieve good performance while respecting these limitations. Otherwise, an agent (such as a pla...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

click to vote

IJCAI
2003

111views Artificial Intelligence» more IJCAI 2003»

Generalizing Plans to New Environments in Relational MDPs

13 years 9 months ago

Download select.cs.cmu.edu

A longstanding goal in planning research is the ability to generalize plans developed for some set of environments to a new but similar environment, with minimal or no replanning....

Carlos Guestrin, Daphne Koller, Chris Gearhart, Ne...

claim paper

Read More »

« Prev « First page 121 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers