Search Sciweavers | Sciweavers

499 search results - page 43 / 100

» Model Minimization in Markov Decision Processes

157

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 4 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

131

click to vote

WEBI
2010
Springer

110views Internet Technology» more WEBI 2010»

Impacts of Analysts' Cognitive Styles on the Analytic Process

15 years 2 months ago

Download facstaff.uww.edu

A user's cognitive style has been found to affect how they search for information, how they analyze the information, and how they make decisions in an analytical process. In ...

Eugene Santos Jr., Hien Nguyen, Fei Yu, Deqing Li,...

claim paper

Read More »

207

click to vote

ICCV
2011
IEEE

218views Computer Vision» more ICCV 2011»

Perturb-and-MAP Random Fields: Using Discrete Optimization\\to Learn and Sample from Energy Models

14 years 4 months ago

$Perturb-and-MAP Random Fields: Using Discrete Optimization\\to Learn and Sample from Energy Models$

Download www.stat.ucla.edu

We propose a novel way to induce a random ﬁeld from an energy function on discrete labels. It amounts to locally injecting noise to the energy potentials, followed by ﬁnding t...

George Papandreou, Alan L. Yuille

claim paper

Read More »

156

click to vote

ICIP
2003
IEEE

172views Image Processing» more ICIP 2003»

A probabilistic framework for image segmentation

16 years 6 months ago

Download www.tsi.enst.fr

A new probabilistic image segmentation model based on hypothesis testing and Gibbs Random Fields is introduced. First, a probabilistic difference measure derived from a set of hyp...

Slawo Wesolkowski, Paul W. Fieguth

claim paper

Read More »

162

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 8 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

« Prev « First page 43 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers