Search Sciweavers | Sciweavers

2665 search results - page 136 / 533

» Bundle Methods for Machine Learning

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Combining Reinforcement Learning with a Local Control Algorithm

14 years 10 months ago

Download www-anw.cs.umass.edu

We explore combining reinforcement learning with a hand-crafted local controller in a manner suggested by the chaotic control algorithm of Vincent, Schmitt and Vincent (1994). A c...

Andrew G. Barto, Jette Randløv, Michael T. ...

claim paper

Read More »

click to vote

ICML
2006
IEEE

193views Machine Learning» more ICML 2006»

Maximum margin planning

14 years 10 months ago

Download www.cs.cmu.edu

Mobile robots often rely upon systems that render sensor data and perceptual features into costs that can be used in a planner. The behavior that a designer wishes the planner to ...

Nathan D. Ratliff, J. Andrew Bagnell, Martin Zinke...

claim paper

Read More »

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

14 years 3 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

click to vote

APIN
1999

107views more APIN 1999»

Massively Parallel Probabilistic Reasoning with Boltzmann Machines

13 years 9 months ago

Download cosco.hiit.fi

We present a method for mapping a given Bayesian network to a Boltzmann machine architecture, in the sense that the the updating process of the resulting Boltzmann machine model pr...

Petri Myllymäki

claim paper

Read More »

click to vote

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

Preference learning with Gaussian processes

14 years 10 months ago

Download www.gatsby.ucl.ac.uk

In this paper, we propose a probabilistic kernel approach to preference learning based on Gaussian processes. A new likelihood function is proposed to capture the preference relat...

Wei Chu, Zoubin Ghahramani

claim paper

Read More »

« Prev « First page 136 / 533 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers