Search Sciweavers | Sciweavers

495 search results - page 36 / 99

» Constructing States for Reinforcement Learning

141

click to vote

ECML
2004
Springer

100views Machine Learning» more ECML 2004»

Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework

15 years 9 months ago

Download bi.snu.ac.kr

Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...

Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...

claim paper

Read More »

120

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 6 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

127

click to vote

IJCAI
2007

143views Artificial Intelligence» more IJCAI 2007»

Direct Code Access in Self-Organizing Neural Networks for Reinforcement Learning

15 years 5 months ago

Download www.aaai.org

TD-FALCON is a self-organizing neural network that incorporates Temporal Difference (TD) methods for reinforcement learning. Despite the advantages of fast and stable learning, TD...

Ah-Hwee Tan

claim paper

Read More »

133

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 5 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

142

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

15 years 10 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

« Prev « First page 36 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers