Search Sciweavers | Sciweavers

1176 search results - page 12 / 236

» Sparse reward processes

191

Voted

COLING
2010

138views Computational Linguistics» more COLING 2010»

Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes

15 years 1 months ago

Download aclweb.org

This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...

Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...

claim paper

Read More »

203

Voted

ICASSP
2011
IEEE

177views Signal Processing» more ICASSP 2011»

Logarithmic weak regret of non-Bayesian restless multi-armed bandit

14 years 10 months ago

Download www.ece.ucdavis.edu

Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

207

click to vote

ICIP
2001
IEEE

144views Image Processing» more ICIP 2001»

A self-referencing level-set method for image reconstruction from sparse Fourier samples

16 years 8 months ago

Download bisp.kaist.ac.kr

Jong Chul Ye, Yoram Bresler, Pierre Moulin

claim paper

Read More »

176

click to vote

ICA
2010
Springer

205views Signal Processing» more ICA 2010»

SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms

15 years 6 months ago

Download www.elec.qmul.ac.uk

SMALLbox is a new foundational framework for processing signals, using adaptive sparse structured representations. The main aim of SMALLbox is to become a test ground for explorati...

Ivan Damnjanovic, Matthew E. P. Davies, Mark D. Pl...

claim paper

Read More »

170

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 26 days ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

« Prev « First page 12 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers