Search Sciweavers | Sciweavers

124 search results - page 17 / 25

» Congestion control as a stochastic control problem with acti...

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

13 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

click to vote

AUGHUMAN
2011

330views Augmented Reality» more AUGHUMAN 2011»

"Vection field" for pedestrian traffic control

13 years 2 months ago

Download kaji-lab.jp

: Today in general traffic field, visual signs and audio cues are used for pedestrian control. As the pedestrians need to acquire and recognize them, time delay between cognition a...

Masahiro Furukawa, Hiromi Yoshikawa, Taku Hachisu,...

claim paper

Read More »

click to vote

ATAL
2005
Springer

125views Intelligent Agents» more ATAL 2005»

Multiagent coordination by Extended Markov Tracking

14 years 1 months ago

Download www.cs.huji.ac.il

We present here Extended Markov Tracking (EMT), a computationally tractable method for the online estimation of Markovian system dynamics, along with experimental support for its ...

Zinovi Rabinovich, Jeffrey S. Rosenschein

claim paper

Read More »

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

13 years 8 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

ML
1998
ACM

101views Machine Learning» more ML 1998»

Elevator Group Control Using Multiple Reinforcement Learning Agents

13 years 7 months ago

Download www.clear.rice.edu

Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...

Robert H. Crites, Andrew G. Barto

claim paper

Read More »

« Prev « First page 17 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers