Search Sciweavers | Sciweavers

181 search results - page 20 / 37

» State Space Reduction For Hierarchical Reinforcement Learnin...

141

click to vote

ECML
2004
Springer

100views Machine Learning» more ECML 2004»

Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework

15 years 9 months ago

Download bi.snu.ac.kr

Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...

Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...

claim paper

Read More »

142

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

15 years 10 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

171

click to vote

NN
2007
Springer

105views Neural Networks» more NN 2007»

Guiding exploration by pre-existing knowledge without modifying reward

15 years 3 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...

Kary Främling

claim paper

Read More »

131

click to vote

IJCAI
2007

140views Artificial Intelligence» more IJCAI 2007»

Utile Distinctions for Relational Reinforcement Learning

15 years 5 months ago

Download www.ijcai.org

We introduce an approach to autonomously creating state space abstractions for an online reinforcement learning agent using a relational representation. Our approach uses a tree-b...

William Dabney, Amy McGovern

claim paper

Read More »

156

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

15 years 11 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

« Prev « First page 20 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers