Search Sciweavers | Sciweavers

29 search results - page 3 / 6

» Automatic basis function construction for approximate dynami...

click to vote

NIPS
2007

80views Information Technology» more NIPS 2007»

Stable Dual Dynamic Programming

13 years 9 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

Voted

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

13 years 7 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

click to vote

AAAI
2006

116views Intelligent Agents» more AAAI 2006»

Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping

13 years 9 months ago

Download www.cs.utexas.edu

Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a ps...

Yaxin Liu, Peter Stone

claim paper

Read More »

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

13 years 7 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

13 years 7 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

« Prev « First page 3 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers