Sciweavers

262 search results - page 40 / 53
» Bounded-Parameter Partially Observable Markov Decision Proce...
Sort
View
NIPS
2008
14 years 17 days ago
Hierarchical Semi-Markov Conditional Random Fields for Recursive Sequential Data
Inspired by the hierarchical hidden Markov models (HHMM), we present the hierarchical semi-Markov conditional random field (HSCRF), a generalisation of embedded undirected Markov ...
Tran The Truyen, Dinh Q. Phung, Hung Hai Bui, Svet...
VTC
2008
IEEE
185views Communications» more  VTC 2008»
14 years 5 months ago
Opportunistic Spectrum Access for Energy-Constrained Cognitive Radios
This paper considers a scenario in which a secondary user makes opportunistic use of a channel allocated to some primary network. The primary network operates in a time-slotted ma...
Anh Tuan Hoang, Ying-Chang Liang, David Tung Chong...
AAAI
2006
14 years 17 days ago
On the Difficulty of Achieving Equilibrium in Interactive POMDPs
We analyze the asymptotic behavior of agents engaged in an infinite horizon partially observable stochastic game as formalized by the interactive POMDP framework. We show that whe...
Prashant Doshi, Piotr J. Gmytrasiewicz
ALT
2006
Springer
14 years 8 months ago
Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...
Daniil Ryabko, Marcus Hutter
ISIPTA
2005
IEEE
161views Mathematics» more  ISIPTA 2005»
14 years 4 months ago
Decision making under incomplete data using the imprecise Dirichlet model
The paper presents an efficient solution to decision problems where direct partial information on the distribution of the states of nature is available, either by observations of ...
Lev V. Utkin, Thomas Augustin