Sciweavers

495 search results - page 46 / 99
» Constructing States for Reinforcement Learning
Sort
View
ICRA
2008
IEEE
173views Robotics» more  ICRA 2008»
15 years 10 months ago
Bayesian reinforcement learning in continuous POMDPs with application to robot navigation
— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...
Stéphane Ross, Brahim Chaib-draa, Joelle Pi...
ML
2002
ACM
114views Machine Learning» more  ML 2002»
15 years 3 months ago
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts
The execution order of a block of computer instructions on a pipelined machine can make a difference in running time by a factor of two or more. Compilers use heuristic schedulers...
Amy McGovern, J. Eliot B. Moss, Andrew G. Barto
ICML
2006
IEEE
16 years 5 months ago
An intrinsic reward mechanism for efficient exploration
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...
Özgür Simsek, Andrew G. Barto
KESAMSTA
2007
Springer
15 years 10 months ago
Reinforcement Learning on a Futures Market Simulator
: In recent years, market forecasting by machine learning methods has been flourishing. Most existing works use a past market data set, because they assume that each trader’s in...
Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...
ALT
2006
Springer
16 years 1 months ago
Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning
Abstract. Recently, some non-regular subclasses of context-free grammars have been found to be efficiently learnable from positive data. In order to use these efficient algorithms ...
Takeshi Shibata, Ryo Yoshinaka, Takashi Chikayama