Search Sciweavers | Sciweavers

615 search results - page 53 / 123

» A social reinforcement learning agent

click to vote

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

14 years 4 months ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 11 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

14 years 3 months ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

click to vote

AAAI
1998

150views Intelligent Agents» more AAAI 1998»

Tree Based Discretization for Continuous State Space Reinforcement Learning

13 years 11 months ago

Download www.cs.cmu.edu

Reinforcement learning is an effective technique for learning action policies in discrete stochastic environments, but its efficiency can decay exponentially with the size of the ...

William T. B. Uther, Manuela M. Veloso

claim paper

Read More »

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Behavior transfer for value-function-based reinforcement learning

14 years 3 months ago

Download www.cs.huji.ac.il

Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

« Prev « First page 53 / 123 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers