Search Sciweavers | Sciweavers

1062 search results - page 60 / 213

» Sublinear Optimization for Machine Learning

133

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

16 years 1 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

199

click to vote

COLT
2007
Springer

143views Machine Learning» more COLT 2007»

Bounded Parameter Markov Decision Processes with Average Reward Criterion

16 years 1 months ago

Download ttic.uchicago.edu

Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with uncertainty in the parameters of a Markov Decision Process (MDP). Unlike the case of an MDP, t...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

193

click to vote

ICML
2004
IEEE

127views Machine Learning» more ICML 2004»

A needle in a haystack: local one-class optimization

16 years 7 months ago

Download www.cis.upenn.edu

This paper addresses the problem of finding a small and coherent subset of points in a given data. This problem, sometimes referred to as one-class or set covering, requires to fi...

Koby Crammer, Gal Chechik

claim paper

Read More »

207

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 7 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

160

click to vote

ICML
2009
IEEE

142views Machine Learning» more ICML 2009»

Curriculum learning

16 years 7 months ago

Download snowbird.djvuzone.org

Humans and animals learn much better when the examples are not randomly presented but organized in a meaningful order which illustrates gradually more concepts, and gradually more ...

Jérôme Louradour, Jason Weston, Ronan...

claim paper

Read More »

« Prev « First page 60 / 213 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers