Search Sciweavers | Sciweavers

163 search results - page 33 / 33

» Policy Gradient Methods for Robotics

158

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 11 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

179

click to vote

SSDBM
2000
IEEE

149views Database» more SSDBM 2000»

Coordinating Simultaneous Caching of File Bundles from Tertiary Storage

15 years 11 months ago

Download www.ocf.berkeley.edu

In a previous paper [Shoshani et al 99], we described a system called STACS (Storage Access Coordination System) for High Energy and Physics (HEP) experiments. These experiments g...

Arie Shoshani, Alex Sim, Luis M. Bernardo, Henrik ...

claim paper

Read More »

185

click to vote

ATAL
2010
Springer

129views Intelligent Agents» more ATAL 2010»

Learning multi-agent state space representations

15 years 7 months ago

Download como.vub.ac.be

This paper describes an algorithm, called CQ-learning, which learns to adapt the state representation for multi-agent systems in order to coordinate with other agents. We propose ...

Yann-Michaël De Hauwere, Peter Vrancx, Ann No...

claim paper

Read More »

« Prev « First page 33 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers