Search Sciweavers | Sciweavers

337 search results - page 37 / 68

» Mean-Variance Optimization in Markov Decision Processes

click to vote

GLOBECOM
2007
IEEE

116views Communications» more GLOBECOM 2007»

Cognitive Medium Access: A Protocol for Enhancing Coexistence in WLAN Bands

14 years 2 months ago

Download acsp.ece.cornell.edu

— In this paper we propose Cognitive Medium Access (CMA), a protocol aimed at improving coexistence with a set of independently evolving WLAN bands. A time-slotted physical layer...

Stefan Geirhofer, Lang Tong, Brian M. Sadler

claim paper

Read More »

click to vote

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

13 years 9 months ago

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

13 years 9 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

click to vote

AAAI
2007

131views Intelligent Agents» more AAAI 2007»

Scaling Up: Solving POMDPs through Value Based Clustering

13 years 10 months ago

Download www.aaai.org

Partially Observable Markov Decision Processes (POMDPs) provide an appropriately rich model for agents operating under partial knowledge of the environment. Since ﬁnding an opti...

Yan Virin, Guy Shani, Solomon Eyal Shimony, Ronen ...

claim paper

Read More »

click to vote

ALDT
2009
Springer

142views Algorithms» more ALDT 2009»

Finding Best k Policies

14 years 2 months ago

Download www.cs.uky.edu

Abstract. An optimal probabilistic-planning algorithm solves a problem, usually modeled by a Markov decision process, by ﬁnding its optimal policy. In this paper, we study the k ...

Peng Dai, Judy Goldsmith

claim paper

Read More »

« Prev « First page 37 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers