Search Sciweavers | Sciweavers

312 search results - page 15 / 63

» Learning Partially Observable Deterministic Action Models

134

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

15 years 3 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

100

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 3 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

121

click to vote

UAI
2008

224views Artificial Intelligence» more UAI 2008»

Sampling First Order Logical Particles

15 years 3 months ago

Download uai2008.cs.helsinki.fi

Approximate inference in dynamic systems is the problem of estimating the state of the system given a sequence of actions and partial observations. High precision estimation is fu...

Hannaneh Hajishirzi, Eyal Amir

claim paper

Read More »

115

click to vote

COLT
2007
Springer

104views Machine Learning» more COLT 2007»

Observational Learning in Random Networks

15 years 8 months ago

Download www.as.inf.ethz.ch

In the standard model of observational learning, n agents sequentially decide between two alternatives a or b, one of which is objectively superior. Their choice is based on a stoc...

Julian Lorenz, Martin Marciniszyn, Angelika Steger

claim paper

Read More »

156

Voted

CVPR
2008
IEEE

304views Computer Vision» more CVPR 2008»

Context and observation driven latent variable model for human pose estimation

16 years 4 months ago

Download www.fxpal.com

Current approaches to pose estimation and tracking can be classified into two categories: generative and discriminative. While generative approaches can accurately determine human...

Abhinav Gupta, Trista Chen, Francine Chen, Don Kim...

claim paper

Read More »

« Prev « First page 15 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers