Sciweavers

450 search results - page 25 / 90
» Adaptive Algorithms for Online Decision Problems
Sort
View
JMLR
2010
189views more  JMLR 2010»
14 years 10 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ICMCS
2006
IEEE
125views Multimedia» more  ICMCS 2006»
15 years 10 months ago
Online Doubletalk Detector Calibration for Acoustic Echo Cancellation in Videoconferencing Systems
This paper addresses the problem of doubletalk detector calibration for acoustic echo cancellers in hands-free environments such as videoconferencing. A statistical model of a rec...
James D. Gordy, Rafik A. Goubran
158
Voted
ICASSP
2009
IEEE
15 years 10 months ago
Data-driven online variational filtering in wireless sensor networks
In this paper, a data-driven extension of the variational algorithm is proposed. Based on a few selected sensors, target tracking is performed distributively without any informati...
Hichem Snoussi, Jean-Yves Tourneret, Petar M. Djur...
WINET
2002
103views more  WINET 2002»
15 years 3 months ago
LeZi-Update: An Information-Theoretic Framework for Personal Mobility Tracking in PCS Networks
The complexity of the mobility tracking problem in a cellular environment has been characterized under an information-theoretic framework. Shannon's entropy measure is identif...
Amiya Bhattacharya, Sajal K. Das
IPPS
2007
IEEE
15 years 10 months ago
Online Aggregation over Trees
Consider a distributed network with nodes arranged in a tree, and each node having a local value. We consider the problem of aggregating values (e.g., summing values) from all nod...
C. Greg Plaxton, Mitul Tiwari, Praveen Yalagandula