Search Sciweavers | Sciweavers

181 search results - page 29 / 37

» State Space Reduction For Hierarchical Reinforcement Learnin...

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

13 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

click to vote

IJRR
2011

159views more IJRR 2011»

Learning visual representations for perception-action systems

13 years 2 months ago

Download robot-learning.de

We discuss vision as a sensory modality for systems that eﬀect actions in response to perceptions. While the internal representations informed by vision may be arbitrarily compl...

Justus H. Piater, Sébastien Jodogne, Renaud...

claim paper

Read More »

click to vote

GLVLSI
2009
IEEE

122views VLSI» more GLVLSI 2009»

Enhancing SAT-based sequential depth computation by pruning search space

14 years 2 months ago

Download nthucad.cs.nthu.edu.tw

The sequential depth determines the completeness of bounded model checking in design veriﬁcation. Recently, a SATbased method is proposed to compute the sequential depth of a de...

Yung-Chih Chen, Chun-Yao Wang

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 8 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

13 years 8 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

« Prev « First page 29 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers