Search Sciweavers | Sciweavers

147 search results - page 27 / 30

» Rule value reinforcement learning for cognitive agents

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 8 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

click to vote

AAAI
2006

118views Intelligent Agents» more AAAI 2006»

Hard Constrained Semi-Markov Decision Processes

13 years 9 months ago

Download www.aaai.org

In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...

Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong

claim paper

Read More »

click to vote

ATAL
2004
Springer

97views Intelligent Agents» more ATAL 2004»

Unifying Temporal and Structural Credit Assignment Problems

14 years 1 months ago

Download ti.arc.nasa.gov

Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

click to vote

ISCAS
2002
IEEE

153views Hardware» more ISCAS 2002»

Biological learning modeled in an adaptive floating-gate system

14 years 17 days ago

Download users.ece.gatech.edu

We have implemented an aspect of learning and memory in the nervous system using analog electronics. Using a simple synaptic circuit we realize networks with Hebbian type adaptati...

Christal Gordon, Paul E. Hasler

claim paper

Read More »

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

13 years 9 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

« Prev « First page 27 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers