Search Sciweavers | Sciweavers

4544 search results - page 3 / 909

» Reinforcement Learning with Time

176

Voted

AAAI
1997

107views Intelligent Agents» more AAAI 1997»

Reinforcement Learning with Time

15 years 8 months ago

Download www.aaai.org

This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...

Daishi Harada

claim paper

Read More »

199

Voted

ICMLA
2009

228views Machine Learning» more ICMLA 2009»

The Neuro Slot Car Racer: Reinforcement Learning in a Real World Setting

15 years 5 months ago

Download www.timkietzmann.de

This paper describes a novel real-world reinforcement learning application: The Neuro Slot Car Racer. In addition to presenting the system and first results based on Neural Fitted...

Tim C. Kietzmann, Martin Riedmiller

claim paper

Read More »

190

Voted

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

16 years 8 months ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

223

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

199

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 6 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

« Prev « First page 3 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers