Search Sciweavers | Sciweavers

262 search results - page 18 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

142

Voted

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

16 years 1 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

206

click to vote

ICASSP
2008
IEEE

163views Signal Processing» more ICASSP 2008»

Link throughput of multi-channel opportunistic access with limited sensing

16 years 1 months ago

Download www.ece.ucdavis.edu

—We aim to characterize the maximum link throughput of a multi-channel opportunistic communication system. The states of these channels evolve as independent and identically dist...

Keqin Liu, Qing Zhao

claim paper

Read More »

184

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

15 years 9 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

200

click to vote

ATAL
2003
Springer

152views Intelligent Agents» more ATAL 2003»

Transition-independent decentralized markov decision processes

16 years 20 days ago

Download anytime.cs.umass.edu

There has been substantial progress with formal models for sequential decision making by individual agents using the Markov decision process (MDP). However, similar treatment of m...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse...

claim paper

Read More »

147

click to vote

ANOR
2010

85views more ANOR 2010»

Inventory management with partially observed nonstationary demand

15 years 7 months ago

Download www.pstat.ucsb.edu

Abstract. We consider a continuous-time model for inventory management with Markov modulated non-stationary demands. We introduce active learning by assuming that the state of the ...

Erhan Bayraktar, Michael Ludkovski

claim paper

Read More »

« Prev « First page 18 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers