Search Sciweavers | Sciweavers

536 search results - page 40 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

click to vote

ICML
1999
IEEE

138views Machine Learning» more ICML 1999»

Using Reinforcement Learning to Spider the Web Efficiently

14 years 8 months ago

Download www.cs.iastate.edu

Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowled...

Jason Rennie, Andrew McCallum

claim paper

Read More »

click to vote

ICA
2007
Springer

182views Signal Processing» more ICA 2007»

Dictionary Learning for L1-Exact Sparse Coding

14 years 1 months ago

Download www.elec.qmul.ac.uk

We have derived a new algorithm for dictionary learning for sparse coding in the ℓ1 exact sparse framework. The algorithm does not rely on an approximation residual to operate, b...

Mark D. Plumbley

claim paper

Read More »

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

14 years 8 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

click to vote

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

14 years 7 days ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 9 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

« Prev « First page 40 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers