Search Sciweavers | Sciweavers

200 search results - page 24 / 40

» Point-Based Policy Iteration

180

Voted

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 8 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

162

click to vote

AUTOMATICA
2005

108views more AUTOMATICA 2005»

Robust optimal control of regular languages

15 years 6 months ago

Download wimpy1.psu.edu

This paper presents an algorithm for robust optimal control of regular languages under specified uncertainty bounds on the event cost parameters of the language measure that has b...

Constantino M. Lagoa, Jinbo Fu, Asok Ray

claim paper

Read More »

145

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 4 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

130

click to vote

ICRA
2010
IEEE

149views Robotics» more ICRA 2010»

A simple learning strategy for high-speed quadrocopter multi-flips

15 years 5 months ago

Download www.idsc.ethz.ch

— We describe a simple and intuitive policy gradient method for improving parametrized quadrocopter multi-ﬂips by combining iterative experiments with information from a ﬁrst...

Sergei Lupashin, Angela Schöllig, Michael She...

claim paper

Read More »

145

click to vote

NIPS
2004

102views Information Technology» more NIPS 2004»

Solitaire: Man Versus Machine

15 years 8 months ago

Download books.nips.cc

In this paper, we use the rollout method for policy improvement to analyze a version of Klondike solitaire. This version, sometimes called thoughtful solitaire, has all cards reve...

Xiang Yan, Persi Diaconis, Paat Rusmevichientong, ...

claim paper

Read More »

« Prev « First page 24 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers