Search Sciweavers | Sciweavers

200 search results - page 10 / 40

» Point-Based Policy Iteration

182

Voted

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

16 years 1 months ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

158

Voted

ICC
2007
IEEE

124views Communications» more ICC 2007»

Optimal Scheduling Policy Determination for High Speed Downlink Packet Access

16 years 1 months ago

Download www.sce.carleton.ca

— In this paper, we present an analytic model and methodology to determine optimal scheduling policy that involves two dimension space allocation: time and code, in High Speed Do...

Hussein Al-Zubaidy, Jerome Talim, Ioannis Lambadar...

claim paper

Read More »

146

Voted

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

16 years 7 months ago

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...

Leon Barrett, Srini Narayanan

claim paper

Read More »

175

click to vote

ATAL
2010
Springer

164views Intelligent Agents» more ATAL 2010»

Point-based policy generation for decentralized POMDPs

15 years 7 months ago

Download anytime.cs.umass.edu

Memory-bounded techniques have shown great promise in solving complex multi-agent planning problems modeled as DEC-POMDPs. Much of the performance gains can be attributed to pruni...

Feng Wu, Shlomo Zilberstein, Xiaoping Chen

claim paper

Read More »

161

click to vote

DEDS
2010

97views more DEDS 2010»

On Regression-Based Stopping Times

15 years 6 months ago

Download www.stanford.edu

We study approaches that fit a linear combination of basis functions to the continuation value function of an optimal stopping problem and then employ a greedy policy based on the...

Benjamin Van Roy

claim paper

Read More »

« Prev « First page 10 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers