Sciweavers

29 search results - page 4 / 6
» Action Refinement in Reinforcement Learning by Probability S...
Sort
View
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
13 years 11 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
ATAL
2008
Springer
13 years 9 months ago
Non-linear dynamics in multiagent reinforcement learning algorithms
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Only a subset of these MARL algorithms both do not require agent...
Sherief Abdallah, Victor R. Lesser
ILP
2007
Springer
14 years 1 months ago
Building Relational World Models for Reinforcement Learning
Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...
Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...
BROADNETS
2004
IEEE
13 years 11 months ago
Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning
The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where i...
Fei Yu, Vincent W. S. Wong, Victor C. M. Leung
ICML
2001
IEEE
14 years 8 months ago
Expectation Maximization for Weakly Labeled Data
We call data weakly labeled if it has no exact label but rather a numerical indication of correctness of the label "guessed" by the learning algorithm - a situation comm...
Yuri A. Ivanov, Bruce Blumberg, Alex Pentland