Search Sciweavers | Sciweavers

995 search results - page 76 / 199

» Learning Useful Horn Approximations

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

14 years 10 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

click to vote

IJCNN
2000
IEEE

167views Neural Networks» more IJCNN 2000»

Metrics that Learn Relevance

14 years 2 months ago

Download lib.tkk.fi

We introduce an algorithm for learning a local metric to a continuous input space that measures distances in terms of relevance to the processing task. The relevance is deﬁned a...

Samuel Kaski, Janne Sinkkonen

claim paper

Read More »

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

14 years 8 days ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

click to vote

NIPS
1994

167views Information Technology» more NIPS 1994»

Active Learning with Statistical Models

13 years 11 months ago

Download wexler.free.fr

For many types of machine learning algorithms, one can compute the statistically optimal" way to select training data. In this paper, we review how optimal data selection tec...

David A. Cohn, Zoubin Ghahramani, Michael I. Jorda...

claim paper

Read More »

click to vote

ICRA
2009
IEEE

188views Robotics» more ICRA 2009»

Onboard contextual classification of 3-D point clouds with learned high-order Markov Random Fields

13 years 7 months ago

Download www.ri.cmu.edu

Contextual reasoning through graphical models such as Markov Random Fields often show superior performance against local classifiers in many domains. Unfortunately, this performanc...

Daniel Munoz, Nicolas Vandapel, Martial Hebert

claim paper

Read More »

« Prev « First page 76 / 199 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers