Sciweavers

3358 search results - page 615 / 672
» The Knowledge Grid Environment
Sort
View
TMC
2011
137views more  TMC 2011»
13 years 3 months ago
Cognitive Medium Access: Exploration, Exploitation, and Competition
— This paper establishes the equivalence between cognitive medium access and the competitive multi-armed bandit problem. First, the scenario in which a single cognitive user wish...
Lifeng Lai, Hesham El Gamal, Hai Jiang, H. Vincent...
TITS
2010
118views Education» more  TITS 2010»
13 years 3 months ago
Vision-Based Infotainment User Determination by Hand Recognition for Driver Assistance
We present a novel real-time computer-vision system that robustly discriminates which of the front-row seat occupants is accessing the infotainment controls. The knowledge of who i...
Shinko Y. Cheng, Mohan M. Trivedi
CORR
2011
Springer
194views Education» more  CORR 2011»
13 years 14 days ago
Accelerating Reinforcement Learning through Implicit Imitation
Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent’s ability to learn useful behaviors by making intelligent use of the kn...
Craig Boutilier, Bob Price
ICASSP
2011
IEEE
13 years 13 days ago
Logarithmic weak regret of non-Bayesian restless multi-armed bandit
Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...
Haoyang Liu, Keqin Liu, Qing Zhao
ICASSP
2011
IEEE
13 years 13 days ago
Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation
In this study, we propose increasing discriminative power on the maximum a posteriori (MAP)-based mapping function estimation for acoustic model adaptation. Based on the effective...
Yu Tsao, Ryosuke Isotani, Hisashi Kawai, Satoshi N...