Search Sciweavers | Sciweavers

178 search results - page 24 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

JUCS
2007

98views more JUCS 2007»

Focus of Attention in Reinforcement Learning

13 years 8 months ago

Download www.research.rutgers.edu

Abstract: Classiﬁcation-based reinforcement learning (RL) methods have recently been proposed as an alternative to the traditional value-function based methods. These methods use...

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

14 years 9 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 8 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

14 years 2 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

14 years 28 days ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 24 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers