Search Sciweavers | Sciweavers

1684 search results - page 104 / 337

» The lexicographic decision function

119

Voted

EWRL
2008

144views Machine Learning» more EWRL 2008»

Regularized Fitted Q-Iteration: Application to Planning

15 years 5 months ago

Download eprints.pascal-network.org

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-i...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

136

Voted

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 5 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

Voted

NIPS
2004

128views Information Technology» more NIPS 2004»

A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees

15 years 5 months ago

Download books.nips.cc

We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of p...

Daniela Pucci de Farias, Benjamin Van Roy

claim paper

Read More »

129

Voted

UAI
2004

92views Artificial Intelligence» more UAI 2004»

Hybrid Influence Diagrams Using Mixtures of Truncated Exponentials

15 years 5 months ago

Download web.ku.edu

Mixtures of truncated exponentials (MTE) potentials are an alternative to discretization for representing continuous chance variables in influence diagrams. Also, MTE potentials c...

Barry R. Cobb, Prakash P. Shenoy

claim paper

Read More »

167

Voted

ICRA
2010
IEEE

101views Robotics» more ICRA 2010»

Multirobot coordination by auctioning POMDPs

15 years 2 months ago

Download users.isr.ist.utl.pt

— We consider the problem of task assignment and execution in multirobot systems, by proposing a procedure for bid estimation in auction protocols. Auctions are of interest to mu...

Matthijs T. J. Spaan, Nelson Gonçalves, Jo&...

claim paper

Read More »

« Prev « First page 104 / 337 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers