Search Sciweavers | Sciweavers

337 search results - page 62 / 68

» Mean-Variance Optimization in Markov Decision Processes

149

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 6 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

177

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 6 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

160

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 6 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

144

click to vote

CDC
2008
IEEE

197views Control Systems» more CDC 2008»

Dynamic spectrum access policies for cognitive radio

15 years 12 months ago

Download www.ifp.illinois.edu

—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

141

click to vote

ICRA
2008
IEEE

128views Robotics» more ICRA 2008»

A point-based POMDP planner for target tracking

15 years 12 months ago

Download www.comp.nus.edu.sg

— Target tracking has two variants that are often studied independently with different approaches: target searching requires a robot to ﬁnd a target initially not visible, and ...

David Hsu, Wee Sun Lee, Nan Rong

claim paper

Read More »

« Prev « First page 62 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers