Search Sciweavers | Sciweavers

340 search results - page 56 / 68

» Kernelized value function approximation for reinforcement le...

209

Voted

BDA
2007

171views Knowledge Management» more BDA 2007»

Hyperplane Queries in a Feature-Space M-tree for Speeding up Active Learning

15 years 3 months ago

Download perso.lcpc.fr

In content-based retrieval, relevance feedback (RF) is a noticeable method for reducing the “semantic gap” between the low-level features describing the content and the usually...

Michel Crucianu, Daniel Estevez, Vincent Oria, Jea...

posted by jptarel

Read More »

116

Voted

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 6 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

126

click to vote

AAAI
2011

144views Intelligent Agents» more AAAI 2011»

Differential Eligibility Vectors for Advantage Updating and Gradient Methods

14 years 2 months ago

Download gaips.inesc-id.pt

In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...

Francisco S. Melo

claim paper

Read More »

Voted

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

15 years 3 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

123

Voted

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 3 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 56 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers