Search Sciweavers | Sciweavers

682 search results - page 54 / 137

» One-Counter Markov Decision Processes

127

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 4 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

119

Voted

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 4 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

120

Voted

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 10 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

164

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 9 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

159

click to vote

ICVS
2001
Springer

117views Computer Vision» more ICVS 2001»

Adapting Object Recognition across Domains: A Demonstration

15 years 8 months ago

Download www.cs.colostate.edu

High-level vision systems use object, scene or domain specific knowledge to interpret images. Unfortunately, this knowledge has to be acquired for every domain. This makes it diffi...

Bruce A. Draper, Ulrike Ahlrichs, Dietrich Paulus

claim paper

Read More »

« Prev « First page 54 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers