Search Sciweavers | Sciweavers

87 search results - page 12 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

TASE
2008
IEEE

100views Software Engineering» more TASE 2008»

Optimization of Joint Replacement Policies for Multipart Systems by a Rollout Framework

13 years 7 months ago

Download www.engr.uconn.edu

Maintaining an asset with life-limited parts, e.g., a jet engine or an electric generator, may be costly. Certain costs, e.g., setup cost, can be shared if some parts of the asset ...

Tao Sun, Qianchuan Zhao, Peter B. Luh, Robert N. T...

claim paper

Read More »

click to vote

ICRA
2003
IEEE

167views Robotics» more ICRA 2003»

Local exploration: online algorithms and a probabilistic framework

14 years 23 days ago

Download www.cis.upenn.edu

— Mapping an environment with an imaging sensor becomes very challenging if the environment to be mapped is unknown and has to be explored. Exploration involves the planning of v...

Volkan Isler, Sampath Kannan, Kostas Daniilidis

claim paper

Read More »

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

13 years 8 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

13 years 11 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

14 years 8 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

« Prev « First page 12 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers