Sciweavers

129 search results - page 14 / 26
» Automatic Recovery Using Bounded Partially Observable Markov...
Sort
View
JAIR
2008
130views more  JAIR 2008»
15 years 3 months ago
Online Planning Algorithms for POMDPs
Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP i...
Stéphane Ross, Joelle Pineau, Sébast...
CORR
2008
Springer
189views Education» more  CORR 2008»
15 years 3 months ago
Algorithms for Dynamic Spectrum Access with Learning for Cognitive Radio
We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooperati...
Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli
AIMSA
2004
Springer
15 years 6 months ago
Towards Well-Defined Multi-agent Reinforcement Learning
Multi-agent reinforcement learning (MARL) is an emerging area of research. However, it lacks two important elements: a coherent view on MARL, and a well-defined problem objective. ...
Rinat Khoussainov
GLOBECOM
2009
IEEE
15 years 24 days ago
Dogfight in Spectrum: Jamming and Anti-Jamming in Multichannel Cognitive Radio Systems
Primary user emulation attack in multichannel cognitive radio systems is discussed. An attacker is assumed to be able to send primary-user-like signals during spectrum sensing peri...
Husheng Li, Zhu Han
NIPS
2001
15 years 4 months ago
Predictive Representations of State
We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...
Michael L. Littman, Richard S. Sutton, Satinder P....