Search Sciweavers | Sciweavers

48 search results - page 2 / 10

» An Analysis of Case-Based Value Function Approximation by Ap...

click to vote

DAGSTUHL
2007

153views Software Engineering» more DAGSTUHL 2007»

A Deeper Investigation of PageRank as a Function of the Damping Factor

13 years 8 months ago

Download drops.dagstuhl.de

PageRank is deﬁned as the stationary state of a Markov chain. The chain is obtained by perturbing the transition matrix induced by a web graph with a damping factor α that spre...

Paolo Boldi, Massimo Santini, Sebastiano Vigna

claim paper

Read More »

click to vote

WWW
2005
ACM

146views Internet Technology» more WWW 2005»

PageRank as a function of the damping factor

14 years 7 months ago

Download www2005.org

PageRank is defined as the stationary state of a Markov chain. The chain is obtained by perturbing the transition matrix induced by a web graph with a damping factor that spreads...

Paolo Boldi, Massimo Santini, Sebastiano Vigna

claim paper

Read More »

click to vote

AIPS
2006

211views Artificial Intelligence» more AIPS 2006»

Solving Factored MDPs with Exponential-Family Transition Models

13 years 8 months ago

Download www.cs.pitt.edu

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

13 years 11 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

click to vote

ICML
2005
IEEE

135views Machine Learning» more ICML 2005»

Finite time bounds for sampling based fitted value iteration

14 years 7 months ago

Download www.machinelearning.org

In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...

Csaba Szepesvári, Rémi Munos

claim paper

Read More »

« Prev « First page 2 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers