Sciweavers

1233 search results - page 40 / 247
» Reinforcement Learning in MirrorBot
Sort
View
EWRL
2008
14 years 22 days ago
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case
We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...
Kirill Dyagilev, Shie Mannor, Nahum Shimkin
AR
2004
84views more  AR 2004»
13 years 10 months ago
Reinforcement learning of humanoid rhythmic walking parameters based on visual information
This paper presents a method for learning the parameters of rhythmic walking to generate purposive humanoid motions. The controller consists of the two layers: rhythmic walking is...
Masaki Ogino, Yutaka Katoh, Masahiro Aono, Minoru ...
ICSTM
2000
103views Management» more  ICSTM 2000»
14 years 9 days ago
The worst failure: repeated failure to learn
Performance measurement systems based on the principle that "if you can't measure it, you can't manage it" reinforce a short-term culture by focussing on tangi...
Alan C. McLucas
AI
1998
Springer
13 years 10 months ago
Model-Based Average Reward Reinforcement Learning
Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...
Prasad Tadepalli, DoKyeong Ok
PRICAI
1999
Springer
14 years 3 months ago
Rationality of Reward Sharing in Multi-agent Reinforcement Learning
Abstract. In multi-agent reinforcement learning systems, it is important to share a reward among all agents. We focus on the Rationality Theorem of Profit Sharing [5] and analyze ...
Kazuteru Miyazaki, Shigenobu Kobayashi