Search Sciweavers | Sciweavers

75 search results - page 6 / 15

» Reinforcement Learning for MDPs with Constraints

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

14 years 2 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

ICML
2008
IEEE

123views Machine Learning» more ICML 2008»

An object-oriented representation for efficient reinforcement learning

14 years 8 months ago

Download paul.rutgers.edu

Rich representations in reinforcement learning have been studied for the purpose of enabling generalization and making learning feasible in large state spaces. We introduce Object...

Carlos Diuk, Andre Cohen, Michael L. Littman

claim paper

Read More »

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

13 years 11 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

13 years 9 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

ICML
2000
IEEE

192views Machine Learning» more ICML 2000»

Convergence Problems of General-Sum Multiagent Reinforcement Learning

14 years 8 months ago

Download www.cs.ualberta.ca

Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...

Michael H. Bowling

claim paper

Read More »

« Prev « First page 6 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers