Sciweavers

1176 search results - page 12 / 236
» Sparse reward processes
Sort
View
141
Voted
COLING
2010
14 years 10 months ago
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...
ICASSP
2011
IEEE
14 years 7 months ago
Logarithmic weak regret of non-Bayesian restless multi-armed bandit
Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...
Haoyang Liu, Keqin Liu, Qing Zhao
164
Voted
ICIP
2001
IEEE
16 years 5 months ago
A self-referencing level-set method for image reconstruction from sparse Fourier samples
Jong Chul Ye, Yoram Bresler, Pierre Moulin
135
Voted
ICA
2010
Springer
15 years 3 months ago
SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms
SMALLbox is a new foundational framework for processing signals, using adaptive sparse structured representations. The main aim of SMALLbox is to become a test ground for explorati...
Ivan Damnjanovic, Matthew E. P. Davies, Mark D. Pl...
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
15 years 9 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...