Search Sciweavers | Sciweavers

473 search results - page 50 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 8 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

ATAL
2005
Springer

117views Intelligent Agents» more ATAL 2005»

Modeling task allocation using a decision theoretic model

14 years 2 months ago

Download dis.cs.umass.edu

Mediation is the process of decomposing a task into subtasks, ﬁnding agents suitable for these subtasks and negotiating with agents to obtain commitments to execute these subtas...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

click to vote

AAAI
2010

154views Intelligent Agents» more AAAI 2010»

Towards Multiagent Meta-level Control

13 years 10 months ago

Download coitweb.uncc.edu

Embedded systems consisting of collaborating agents capable of interacting with their environment are becoming ubiquitous. It is crucial for these systems to be able to adapt to t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 3 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

14 years 2 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

« Prev « First page 50 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers