Sciweavers

797 search results - page 49 / 160
» Timed Control with Partial Observability
Sort
View
SIAMCO
2010
114views more  SIAMCO 2010»
13 years 4 months ago
Discrete Carleman Estimates for Elliptic Operators in Arbitrary Dimension and Applications
In arbitrary dimension, we consider the semi-discrete elliptic operator -2 t + AM , where AM is a finite difference approximation of the operator - x((x) x). For this operator we d...
Franck Boyer, Florence Hubert, Jérôme...
JAIR
2006
160views more  JAIR 2006»
13 years 10 months ago
Anytime Point-Based Approximations for Large POMDPs
The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact s...
Joelle Pineau, Geoffrey J. Gordon, Sebastian Thrun
ATAL
2009
Springer
14 years 4 months ago
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...
Michael R. James, Satinder P. Singh
ICC
2008
IEEE
169views Communications» more  ICC 2008»
14 years 4 months ago
Optimality of Myopic Sensing in Multi-Channel Opportunistic Access
—We consider opportunistic communications over multiple channels where the state (“good” or “bad”) of each channel evolves as independent and identically distributed Mark...
Tara Javidi, Bhaskar Krishnamachari, Qing Zhao, Mi...
GLOBECOM
2010
IEEE
13 years 8 months ago
Maximize Secondary User Throughput via Optimal Sensing in Multi-Channel Cognitive Radio Networks
In a cognitive radio network, the full-spectrum is usually divided into multiple channels. However, due to the hardware and energy constraints, a cognitive user (also called second...
Shimin Gong, Ping Wang, Wei Liu, Wei Yuan