Search Sciweavers | Sciweavers

164 search results - page 12 / 33

» Self-Optimizing Memory Controllers: A Reinforcement Learning...

156

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

181

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 1 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

161

click to vote

BMCV
2000
Springer

170views Computer Vision» more BMCV 2000»

Unsupervised Learning of Biologically Plausible Object Recognition Strategies

15 years 11 months ago

Download www.cs.colostate.edu

Recent psychological and neurological evidence suggests that biological object recognition is a process of matching sensed images to stored iconic memories. This paper presents a p...

Bruce A. Draper, Kyungim Baek

claim paper

Read More »

164

click to vote

ILP
2005
Springer

149views Automated Reasoning» more ILP 2005»

Guiding Inference Through Relational Reinforcement Learning

16 years 6 days ago

Download cll.stanford.edu

Abstract. Reasoning plays a central role in intelligent systems that operate in complex situations that involve time constraints. In this paper, we present the Adaptive Logic Inter...

Nima Asgharbeygi, Negin Nejati, Pat Langley, Sachi...

claim paper

Read More »

158

click to vote

CEC
2007
IEEE

126views Artificial Intelligence» more CEC 2007»

Double-deck elevator systems using Genetic Network Programming with reinforcement learning

15 years 10 months ago

Download www.cs.york.ac.uk

Abstract-- In order to increase the transportation capability of elevator group systems in high-rise buildings without adding elevator installation space, double-deck elevator syst...

Jin Zhou, Lu Yu, Shingo Mabu, Kotaro Hirasawa, Jin...

claim paper

Read More »

« Prev « First page 12 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers