Search Sciweavers | Sciweavers

164

Voted

IJCAI
2003

122views Artificial Intelligence» more IJCAI 2003»

Point-based value iteration: An anytime algorithm for POMDPs

15 years 8 months ago

This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution by selecting a small set of represen...

Joelle Pineau, Geoffrey J. Gordon, Sebastian Thrun

claim paper

Read More »

183

Voted

ICN
2007
Springer

97views Computer Networks» more ICN 2007»

Heuristic Approach of Optimal Code Allocation in High Speed Downlink Packet Access Networks

16 years 24 days ago

Download www.sce.carleton.ca

— In this paper, we use the Markov Decision Process (MDP) technique to ﬁnd the optimal code allocation policy in High-Speed Downlink Packet Access (HSDPA) networks. A discrete ...

Hussein Al-Zubaidy, Jerome Talim, Ioannis Lambadar...

claim paper

Read More »

164

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 7 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

190

Voted

CDC
2008
IEEE

206views Control Systems» more CDC 2008»

Approximate dynamic programming using support vector regression

16 years 1 months ago

Download web.mit.edu

— This paper presents a new approximate policy iteration algorithm based on support vector regression (SVR). It provides an overview of commonly used cost approximation architect...

Brett Bethke, Jonathan P. How, Asuman E. Ozdaglar

claim paper

Read More »

207

click to vote

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

14 years 2 months ago

Download www.mit.edu

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers