Search Sciweavers | Sciweavers

495 search results - page 46 / 99

» Constructing States for Reinforcement Learning

142

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

15 years 10 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

124

click to vote

ML
2002
ACM

114views Machine Learning» more ML 2002»

Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts

15 years 3 months ago

Download www.cs.ou.edu

The execution order of a block of computer instructions on a pipelined machine can make a difference in running time by a factor of two or more. Compilers use heuristic schedulers...

Amy McGovern, J. Eliot B. Moss, Andrew G. Barto

claim paper

Read More »

124

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 5 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

113

click to vote

KESAMSTA
2007
Springer

129views Intelligent Agents» more KESAMSTA 2007»

Reinforcement Learning on a Futures Market Simulator

15 years 10 months ago

Download www.jucs.org

: In recent years, market forecasting by machine learning methods has been ﬂourishing. Most existing works use a past market data set, because they assume that each trader’s in...

Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...

claim paper

Read More »

151

click to vote

ALT
2006
Springer

146views Machine Learning» more ALT 2006»

Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

16 years 1 months ago

Download www.logos.t.u-tokyo.ac.jp

Abstract. Recently, some non-regular subclasses of context-free grammars have been found to be eﬃciently learnable from positive data. In order to use these eﬃcient algorithms ...

Takeshi Shibata, Ryo Yoshinaka, Takashi Chikayama

claim paper

Read More »

« Prev « First page 46 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers