Search Sciweavers | Sciweavers

185 search results - page 8 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

118

click to vote

UAI
2004

131views Artificial Intelligence» more UAI 2004»

Dynamic Programming for Structured Continuous Markov Decision Problems

15 years 4 months ago

Download www.cs.bham.ac.uk

We describe an approach for exploiting structure in Markov Decision Processes with continuous state variables. At each step of the dynamic programming, the state space is dynamica...

Zhengzhu Feng, Richard Dearden, Nicolas Meuleau, R...

claim paper

Read More »

117

Voted

KDD
2010
ACM

282views Data Mining» more KDD 2010»

Optimizing debt collections using constrained reinforcement learning

15 years 7 months ago

Download www.prem-melville.com

In this paper, we propose and develop a novel approach to the problem of optimally managing the tax, and more generally debt, collections processes at ﬁnancial institutions. Our...

Naoki Abe, Prem Melville, Cezar Pendus, Chandan K....

claim paper

Read More »

146

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 4 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

136

Voted

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 2 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

110

click to vote

ATAL
2007
Springer

112views Intelligent Agents» more ATAL 2007»

A globally optimal algorithm for TTD-MDPs

15 years 9 months ago

Download www.cc.gatech.edu

In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)—a variant of MDPs in which the goal is to realize a speciﬁed distrib...

Sooraj Bhat, David L. Roberts, Mark J. Nelson, Cha...

claim paper

Read More »

« Prev « First page 8 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers