Search Sciweavers | Sciweavers

81 search results - page 16 / 17

» An extended policy gradient algorithm for robot task learnin...

151

click to vote

AI
1998
Springer

141views Artificial Intelligence» more AI 1998»

Utility-Based On-Line Exploration for Repeated Navigation in an Embedded Graph

15 years 2 months ago

Download lingcog.iit.edu

In this paper, we address the tradeo between exploration and exploitation for agents which need to learn more about the structure of their environment in order to perform more e e...

Shlomo Argamon-Engelson, Sarit Kraus, Sigalit Sina

claim paper

Read More »

129

click to vote

ATAL
2010
Springer

129views Intelligent Agents» more ATAL 2010»

Learning multi-agent state space representations

15 years 4 months ago

Download como.vub.ac.be

This paper describes an algorithm, called CQ-learning, which learns to adapt the state representation for multi-agent systems in order to coordinate with other agents. We propose ...

Yann-Michaël De Hauwere, Peter Vrancx, Ann No...

claim paper

Read More »

120

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 7 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

121

click to vote

ATAL
2010
Springer

123views Intelligent Agents» more ATAL 2010»

Linear options

15 years 4 months ago

Download www.eecs.umich.edu

Learning, planning, and representing knowledge in large state t multiple levels of temporal abstraction are key, long-standing challenges for building flexible autonomous agents. ...

Jonathan Sorg, Satinder P. Singh

claim paper

Read More »

139

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 3 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

« Prev « First page 16 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers