Search Sciweavers | Sciweavers

181 search results - page 7 / 37

» On Policy Learning in Restricted Policy Spaces

click to vote

ATAL
2004
Springer

105views Intelligent Agents» more ATAL 2004»

Best-Response Multiagent Learning in Non-Stationary Environments

14 years 1 months ago

Download www.odu.edu

This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...

Michael Weinberg, Jeffrey S. Rosenschein

claim paper

Read More »

click to vote

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

14 years 8 months ago

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

click to vote

ICCCI
2011
Springer

223views Intelligent Agents» more ICCCI 2011»

Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors

12 years 7 months ago

Download florinleon.byethost24.com

Multiagent reinforcement learning problems are especially difficult because of their dynamism and the size of joint state space. In this paper a new benchmark problem is proposed, ...

Florin Leon

claim paper

Read More »

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

14 years 1 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

13 years 9 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

« Prev « First page 7 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers