Search Sciweavers | Sciweavers

46 search results - page 6 / 10

» A Sparse Sampling Algorithm for Near-Optimal Planning in Lar...

130

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

16 years 4 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

108

Voted

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

15 years 5 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

133

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 5 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

125

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

PUMA: Planning Under Uncertainty with Macro-Actions

15 years 5 months ago

Download www.cs.berkeley.edu

Planning in large, partially observable domains is challenging, especially when a long-horizon lookahead is necessary to obtain a good policy. Traditional POMDP planners that plan...

Ruijie He, Emma Brunskill, Nicholas Roy

claim paper

Read More »

202

click to vote

Publication

273views

Monte Carlo Value Iteration for Continuous-State POMDPs

14 years 11 months ago

Download www.comp.nus.edu.sg

Partially observable Markov decision processes (POMDPs) have been successfully applied to various robot motion planning tasks under uncertainty. However, most existing POMDP algo...

Haoyu Bai, David Hsu, Wee Sun Lee, and Vien A. Ngo

posted by bhy

Read More »

« Prev « First page 6 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers