Search Sciweavers | Sciweavers

107 search results - page 5 / 22

» Approximate Linear Programming for Constrained Partially Obs...

122

click to vote

AAAI
2007

101views Intelligent Agents» more AAAI 2007»

Purely Epistemic Markov Decision Processes

15 years 5 months ago

Download www.aaai.org

Planning under uncertainty involves two distinct sources of uncertainty: uncertainty about the effects of actions and uncertainty about the current state of the world. The most wi...

Régis Sabbadin, Jérôme Lang, N...

claim paper

Read More »

134

click to vote

GLOBECOM
2007
IEEE

134views Communications» more GLOBECOM 2007»

Bursty Traffic in Energy-Constrained Opportunistic Spectrum Access

15 years 7 months ago

Download www.ece.ucdavis.edu

We design opportunistic spectrum access strategies for improving spectrum efficiency. In each slot, a secondary user chooses a subset of channels to sense and decides whether to ac...

Yunxia Chen, Qing Zhao, Ananthram Swami

claim paper

Read More »

107

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 7 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

166

Voted

AAAI
2011

136views Intelligent Agents» more AAAI 2011»

Linear Dynamic Programs for Resource Management

14 years 3 months ago

Download www.cs.umass.edu

Sustainable resource management in many domains presents large continuous stochastic optimization problems, which can often be modeled as Markov decision processes (MDPs). To solv...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

128

Voted

AAAI
2010

180views Intelligent Agents» more AAAI 2010»

Relational Partially Observable MDPs

15 years 4 months ago

Download www.cs.tufts.edu

Relational Markov Decision Processes (MDP) are a useraction for stochastic planning problems since one can develop abstract solutions for them that are independent of domain size ...

Chenggang Wang, Roni Khardon

claim paper

Read More »

« Prev « First page 5 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers