Search Sciweavers | Sciweavers

51 search results - page 4 / 11

» Characterizing reinforcement learning methods through parame...

116

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 9 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

122

click to vote

ECML
2004
Springer

100views Machine Learning» more ECML 2004»

Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework

15 years 7 months ago

Download bi.snu.ac.kr

Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...

Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...

claim paper

Read More »

106

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

15 years 7 months ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

127

click to vote

ATAL
2009
Springer

167views Intelligent Agents» more ATAL 2009»

Solving multiagent assignment Markov decision processes

15 years 9 months ago

Download www.aamas-conference.org

We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

118

Voted

AR
2007

105views more AR 2007»

Reinforcement learning of a continuous motor sequence with hidden states

15 years 2 months ago

Download www.bdc.brain.riken.go.jp

—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...

Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...

claim paper

Read More »

« Prev « First page 4 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers