Sciweavers

124 search results - page 17 / 25
» Congestion control as a stochastic control problem with acti...
Sort
View
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
13 years 2 months ago
Adaptive bases for Q-learning
Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...
Dotan Di Castro, Shie Mannor
AUGHUMAN
2011
13 years 2 months ago
"Vection field" for pedestrian traffic control
: Today in general traffic field, visual signs and audio cues are used for pedestrian control. As the pedestrians need to acquire and recognize them, time delay between cognition a...
Masahiro Furukawa, Hiromi Yoshikawa, Taku Hachisu,...
ATAL
2005
Springer
14 years 1 months ago
Multiagent coordination by Extended Markov Tracking
We present here Extended Markov Tracking (EMT), a computationally tractable method for the online estimation of Markovian system dynamics, along with experimental support for its ...
Zinovi Rabinovich, Jeffrey S. Rosenschein
AAAI
2000
13 years 8 months ago
Localizing Search in Reinforcement Learning
Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...
Gregory Z. Grudic, Lyle H. Ungar
ML
1998
ACM
101views Machine Learning» more  ML 1998»
13 years 7 months ago
Elevator Group Control Using Multiple Reinforcement Learning Agents
Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...
Robert H. Crites, Andrew G. Barto