Sciweavers

337 search results - page 15 / 68
» Mean-Variance Optimization in Markov Decision Processes
Sort
View
ICML
2006
IEEE
14 years 8 months ago
Fast direct policy evaluation using multiscale analysis of Markov diffusion processes
Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...
Mauro Maggioni, Sridhar Mahadevan
CORR
2010
Springer
105views Education» more  CORR 2010»
13 years 6 months ago
Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence
We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...
Sarah Filippi, Olivier Cappé, Aurelien Gari...
JAIR
2006
122views more  JAIR 2006»
13 years 7 months ago
Solving Factored MDPs with Hybrid State and Action Variables
Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automa...
Branislav Kveton, Milos Hauskrecht, Carlos Guestri...
ISSS
1999
IEEE
121views Hardware» more  ISSS 1999»
13 years 12 months ago
Event-Driven Power Management of Portable Systems
The policy optimization problem for dynamic power management has received considerable attention in the recent past. We formulate policy optimization as a constrained optimization...
Tajana Simunic, Giovanni De Micheli, Luca Benini
ICTAI
2007
IEEE
14 years 2 months ago
Multi-criteria Decision Making for Local Coordination in Multi-agent Systems
Unlike mono-agent systems, multi-agent planing addresses the problem of resolving conflicts between individual and group interests. In this paper, we are using a Decentralized Ve...
Matthieu Boussard, Maroua Bouzid, Abdel-Illah Moua...