Search Sciweavers | Sciweavers

332 search results - page 31 / 67

» Ranking policies in discrete Markov decision processes

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 8 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

ICC
2007
IEEE

98views Communications» more ICC 2007»

Dynamic Lightpath Establishment for Service Differentiation Based on Optimal MDP Policy in All-Optical Networks with Wavelength

14 years 2 months ago

Download www.prism.uvsq.fr

— In this paper, we propose a dynamic lightpath establishment method for service differentiation in all-optical WDM networks with the capability of full-range wavelength conversi...

Takuji Tachibana, Shoji Kasahara, Kenji Sugimoto

claim paper

Read More »

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

13 years 9 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

14 years 8 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

14 years 2 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

« Prev « First page 31 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers