Search Sciweavers | Sciweavers

59 search results - page 4 / 12

» Reinforcement learning of a simple control task using the sp...

click to vote

ICRA
2009
IEEE

111views Robotics» more ICRA 2009»

Model-based and model-free reinforcement learning for visual servoing

14 years 2 months ago

Download webdocs.cs.ualberta.ca

— To address the difﬁculty of designing a controller for complex visual-servoing tasks, two learning-based uncalibrated approaches are introduced. The ﬁrst method starts by b...

Amir Massoud Farahmand, Azad Shademan, Martin J&au...

claim paper

Read More »

click to vote

JMLR
2002

133views more JMLR 2002»

Learning Precise Timing with LSTM Recurrent Networks

13 years 7 months ago

Download jmlr.csail.mit.edu

The temporal distance between events conveys information essential for numerous sequential tasks such as motor control and rhythm detection. While Hidden Markov Models tend to ign...

Felix A. Gers, Nicol N. Schraudolph, Jürgen S...

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

click to vote

FLAIRS
2006

109views Artificial Intelligence» more FLAIRS 2006»

Refining Human Behavior Models in a Context-based Architecture

13 years 9 months ago

Download www.aaai.org

This paper describes an investigation into the refinement of context-based human behavior models through the use of experiential learning. Specifically, a tactical agent was endow...

David Aihe, Avelino J. Gonzalez

claim paper

Read More »

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

13 years 9 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

« Prev « First page 4 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers