Search Sciweavers | Sciweavers

17 search results - page 3 / 4

» Analysis of a Classification-based Policy Iteration Algorith...

180

click to vote

ICRA
2008
IEEE

167views Robotics» more ICRA 2008»

An approximate algorithm for solving oracular POMDPs

16 years 1 months ago

Download www.cs.cmu.edu

Abstract— We propose a new approximate algorithm, LAJIV (Lookahead J-MDP Information Value), to solve Oracular Partially Observable Markov Decision Problems (OPOMDPs), a special ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

186

Voted

ESOP
2007
Springer

94views Programming Languages» more ESOP 2007»

Small Witnesses for Abstract Interpretation-Based Proofs

16 years 1 months ago

Download www.irisa.fr

tnesses for Abstract Interpretation-based Proofs Fr´ed´eric Besson, Thomas Jensen, and Tiphaine Turpin IRISA/{Inria, CNRS, Universit´e de Rennes 1} Campus de Beaulieu, F-35042 R...

Frédéric Besson, Thomas P. Jensen, T...

claim paper

Read More »

188

click to vote

QUESTA
2010

112views more QUESTA 2010»

Admission control for a multi-server queue with abandonment

15 years 5 months ago

Download www-bcf.usc.edu

In a M/M/N+M queue, when there are many customers waiting, it may be preferable to reject a new arrival rather than risk that arrival later abandoning without receiving service. O...

Yasar Levent Koçaga, Amy R. Ward

claim paper

Read More »

173

click to vote

UAI
2004

121views Artificial Intelligence» more UAI 2004»

Discretized Approximations for POMDP with Average Cost

15 years 8 months ago

Download web.mit.edu

In this paper, we propose a new lower approximation scheme for POMDP with discounted and average cost criterion. The approximating functions are determined by their values at a fi...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

180

click to vote

ICML
2005
IEEE

145views Machine Learning» more ICML 2005»

Proto-value functions: developmental reinforcement learning

16 years 8 months ago

Download www.cs.umass.edu

This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...

Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers