Search Sciweavers | Sciweavers

92 search results - page 4 / 19

» A General Convergence Method for Reinforcement Learning in t...

click to vote

ICAI
2004

116views Artificial Intelligence» more ICAI 2004»

Action Inhibition

13 years 8 months ago

Download mysite.verizon.net

An explicit exploration strategy is necessary in reinforcement learning (RL) to balance the need to reduce the uncertainty associated with the expected outcome of an action and the...

Myriam Abramson

claim paper

Read More »

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

13 years 6 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

14 years 3 days ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

14 years 4 days ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

click to vote

ICCBR
2005
Springer

210views Automated Reasoning» more ICCBR 2005»

CBR for State Value Function Approximation in Reinforcement Learning

14 years 7 days ago

Download ml.informatik.uni-freiburg.de

CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

« Prev « First page 4 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers