Search Sciweavers | Sciweavers

267 search results - page 51 / 54

» The Dynamics of Multi-Agent Reinforcement Learning

189

click to vote

NIPS
2001

101views Information Technology» more NIPS 2001»

The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay

15 years 8 months ago

Download books.nips.cc

Tangential hand velocity profiles of rapid human arm movements often appear as sequences of several bell-shaped acceleration-deceleration phases called submovements or movement un...

Michael Kositsky, Andrew G. Barto

claim paper

Read More »

184

Voted

KI
2002
Springer

108views Artificial Intelligence» more KI 2002»

Qualitative Velocity and Ball Interception

15 years 6 months ago

Download fstolzenburg.hs-harz.de

In many approaches for qualitative spatial reasoning, navigation of an agent in a more or less static environment is considered (e.g. in the double-cross calculus [12]). However, i...

Frieder Stolzenburg, Oliver Obst, Jan Murray

claim paper

Read More »

227

click to vote

GLOBECOM
2009
IEEE

253views Communications» more GLOBECOM 2009»

Cooperative Communications with Relay Selection for QoS Provisioning in Wireless Sensor Networks

15 years 4 months ago

Download mmlab.snu.ac.kr

Abstract--Cooperative communications have been demonstrated to be effective in combating the multiple fading effects in wireless networks, and improving the network performance in ...

Xuedong Liang, Ilangko Balasingham, Victor C. M. L...

claim paper

Read More »

221

click to vote

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

16 years 1 months ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

227

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 8 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

« Prev « First page 51 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers