Search Sciweavers | Sciweavers

1630 search results - page 81 / 326

» Coordinated Reinforcement Learning

148

Voted

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 5 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

137

Voted

AR
2004

84views more AR 2004»

Reinforcement learning of humanoid rhythmic walking parameters based on visual information

15 years 3 months ago

Download www.er.ams.eng.osaka-u.ac.jp

This paper presents a method for learning the parameters of rhythmic walking to generate purposive humanoid motions. The controller consists of the two layers: rhythmic walking is...

Masaki Ogino, Yutaka Katoh, Masahiro Aono, Minoru ...

claim paper

Read More »

225

Voted

ICSTM
2000

103views Management» more ICSTM 2000»

The worst failure: repeated failure to learn

15 years 5 months ago

Download www.aes.asn.au

Performance measurement systems based on the principle that "if you can't measure it, you can't manage it" reinforce a short-term culture by focussing on tangi...

Alan C. McLucas

claim paper

Read More »

187

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

15 years 3 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

109

click to vote

KESAMSTA
2007
Springer

129views Intelligent Agents» more KESAMSTA 2007»

Reinforcement Learning on a Futures Market Simulator

15 years 10 months ago

Download www.jucs.org

: In recent years, market forecasting by machine learning methods has been ﬂourishing. Most existing works use a past market data set, because they assume that each trader’s in...

Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...

claim paper

Read More »

« Prev « First page 81 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers