Search Sciweavers | Sciweavers

337 search results - page 59 / 68

» Mean-Variance Optimization in Markov Decision Processes

179

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 6 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

177

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

15 years 7 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

142

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

PUMA: Planning Under Uncertainty with Macro-Actions

15 years 6 months ago

Download www.cs.berkeley.edu

Planning in large, partially observable domains is challenging, especially when a long-horizon lookahead is necessary to obtain a good policy. Traditional POMDP planners that plan...

Ruijie He, Emma Brunskill, Nicholas Roy

claim paper

Read More »

157

Voted

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

15 years 11 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

130

click to vote

WCNC
2008
IEEE

101views Computer Networks» more WCNC 2008»

A Maximum-Throughput Call Admission Control Policy for CDMA Beamforming Systems

15 years 11 months ago

Download post.queensu.ca

— A throughput-maximization call admission control (CAC) policy is proposed for CDMA beamforming systems in which the QoS requirements in both physical and network layers can be ...

Wei Sheng, Steven D. Blostein

claim paper

Read More »

« Prev « First page 59 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers