Sciweavers

181 search results - page 20 / 37
» State Space Reduction For Hierarchical Reinforcement Learnin...
Sort
View
ECML
2004
Springer
14 years 26 days ago
Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework
Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...
Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...
ICANNGA
2007
Springer
105views Algorithms» more  ICANNGA 2007»
14 years 1 months ago
Reinforcement Learning in Fine Time Discretization
Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...
Pawel Wawrzynski
NN
2007
Springer
105views Neural Networks» more  NN 2007»
13 years 7 months ago
Guiding exploration by pre-existing knowledge without modifying reward
Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...
Kary Främling
IJCAI
2007
13 years 9 months ago
Utile Distinctions for Relational Reinforcement Learning
We introduce an approach to autonomously creating state space abstractions for an online reinforcement learning agent using a relational representation. Our approach uses a tree-b...
William Dabney, Amy McGovern
IROS
2009
IEEE
206views Robotics» more  IROS 2009»
14 years 2 months ago
Bayesian reinforcement learning in continuous POMDPs with gaussian processes
— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...
Patrick Dallaire, Camille Besse, Stéphane R...