Search Sciweavers | Sciweavers

71 search results - page 5 / 15

» An Analysis of Direct Reinforcement Learning in Non-Markovia...

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

13 years 5 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

click to vote

ECAL
2001
Springer

110views Artificial Intelligence» more ECAL 2001»

Evolution of Reinforcement Learning in Uncertain Environments: Emergence of Risk-Aversion and Matching

14 years 2 days ago

Download gandalf.psych.umn.edu

Reinforcement learning (RL) is a fundamental process by which organisms learn to achieve a goal from interactions with the environment. Using Artiﬁcial Life techniques we derive ...

Yael Niv, Daphna Joel, Isaac Meilijson, Eytan Rupp...

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

14 years 8 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

click to vote

SCIA
2005
Springer

211views Image Analysis» more SCIA 2005»

Perception-Action Based Object Detection from Local Descriptor Combination and Reinforcement Learning

14 years 1 months ago

Download www.mobvis.org

This work proposes to learn visual encodings of attention patterns that enables sequential attention for object detection in real world environments. The system embeds a saccadic d...

Lucas Paletta, Gerald Fritz, Christin Seifert

claim paper

Read More »

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

« Prev « First page 5 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers