Sciweavers

4544 search results - page 96 / 909
» Reinforcement Learning with Time
Sort
View
AR
2004
84views more  AR 2004»
13 years 8 months ago
Reinforcement learning of humanoid rhythmic walking parameters based on visual information
This paper presents a method for learning the parameters of rhythmic walking to generate purposive humanoid motions. The controller consists of the two layers: rhythmic walking is...
Masaki Ogino, Yutaka Katoh, Masahiro Aono, Minoru ...
ICSTM
2000
103views Management» more  ICSTM 2000»
13 years 10 months ago
The worst failure: repeated failure to learn
Performance measurement systems based on the principle that "if you can't measure it, you can't manage it" reinforce a short-term culture by focussing on tangi...
Alan C. McLucas
KESAMSTA
2007
Springer
14 years 3 months ago
Reinforcement Learning on a Futures Market Simulator
: In recent years, market forecasting by machine learning methods has been flourishing. Most existing works use a past market data set, because they assume that each trader’s in...
Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...
VLSID
2005
IEEE
105views VLSI» more  VLSID 2005»
14 years 2 months ago
Placement and Routing for 3D-FPGAs Using Reinforcement Learning and Support Vector Machines
The primary advantage of using 3D-FPGA over 2D-FPGA is that the vertical stacking of active layers reduce the Manhattan distance between the components in 3D-FPGA than when placed...
R. Manimegalai, E. Siva Soumya, V. Muralidharan, B...
PRICAI
1999
Springer
14 years 1 months ago
Rationality of Reward Sharing in Multi-agent Reinforcement Learning
Abstract. In multi-agent reinforcement learning systems, it is important to share a reward among all agents. We focus on the Rationality Theorem of Profit Sharing [5] and analyze ...
Kazuteru Miyazaki, Shigenobu Kobayashi