Search Sciweavers | Sciweavers

361 search results - page 6 / 73

» Approximate counting by dynamic programming

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

13 years 7 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

click to vote

NIPS
2008

109views Information Technology» more NIPS 2008»

Biasing Approximate Dynamic Programming with a Lower Discount Factor

13 years 9 months ago

Download hal.inria.fr

Most algorithms for solving Markov decision processes rely on a discount factor, which ensures their convergence. It is generally assumed that using an artificially low discount f...

Marek Petrik, Bruno Scherrer

claim paper

Read More »

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

13 years 7 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

click to vote

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

14 years 8 months ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

click to vote

HICSS
2009
IEEE

108views Biometrics» more HICSS 2009»

Approximate Dynamic Programming in Knowledge Discovery for Rapid Response

14 years 2 months ago

Download people.orie.cornell.edu

One knowledge discovery problem in the rapid response setting is the cost of learning which patterns are indicative of a threat. This typically involves a detailed follow-through,...

Peter Frazier, Warren B. Powell, Savas Dayanik, Pa...

claim paper

Read More »

« Prev « First page 6 / 73 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers