Search Sciweavers | Sciweavers

91 search results - page 15 / 19

» Percentile Optimization for Markov Decision Processes with P...

click to vote

CORR
2008
Springer

189views Education» more CORR 2008»

Algorithms for Dynamic Spectrum Access with Learning for Cognitive Radio

13 years 7 months ago

Download www.ifp.illinois.edu

We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooperati...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

click to vote

CORR
2010
Springer

88views Education» more CORR 2010»

Multiple Timescale Dispatch and Scheduling for Stochastic Reliability in Smart Grids with Wind Generation Integration

13 years 7 months ago

Download informationnet.asu.edu

Integrating volatile renewable energy resources into the bulk power grid is challenging, due to the reliability requirement that at each instant the load and generation in the syst...

Miao He, Sugumar Murugesan, Junshan Zhang

claim paper

Read More »

click to vote

ICTAI
2009
IEEE

86views Artificial Intelligence» more ICTAI 2009»

TiMDPpoly: An Improved Method for Solving Time-Dependent MDPs

13 years 5 months ago

Download www.montefiore.ulg.ac.be

We introduce TiMDPpoly, an algorithm designed to solve planning problems with durative actions, under probabilistic uncertainty, in a non-stationary, continuous-time context. Miss...

Emmanuel Rachelson, Patrick Fabiani, Fréd&e...

claim paper

Read More »

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

14 years 1 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

13 years 10 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

« Prev « First page 15 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers