Sciweavers

450 search results - page 25 / 90
» Adaptive Algorithms for Online Decision Problems
Sort
View
JMLR
2010
189views more  JMLR 2010»
13 years 2 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ICMCS
2006
IEEE
125views Multimedia» more  ICMCS 2006»
14 years 1 months ago
Online Doubletalk Detector Calibration for Acoustic Echo Cancellation in Videoconferencing Systems
This paper addresses the problem of doubletalk detector calibration for acoustic echo cancellers in hands-free environments such as videoconferencing. A statistical model of a rec...
James D. Gordy, Rafik A. Goubran
ICASSP
2009
IEEE
14 years 2 months ago
Data-driven online variational filtering in wireless sensor networks
In this paper, a data-driven extension of the variational algorithm is proposed. Based on a few selected sensors, target tracking is performed distributively without any informati...
Hichem Snoussi, Jean-Yves Tourneret, Petar M. Djur...
WINET
2002
103views more  WINET 2002»
13 years 7 months ago
LeZi-Update: An Information-Theoretic Framework for Personal Mobility Tracking in PCS Networks
The complexity of the mobility tracking problem in a cellular environment has been characterized under an information-theoretic framework. Shannon's entropy measure is identif...
Amiya Bhattacharya, Sajal K. Das
IPPS
2007
IEEE
14 years 2 months ago
Online Aggregation over Trees
Consider a distributed network with nodes arranged in a tree, and each node having a local value. We consider the problem of aggregating values (e.g., summing values) from all nod...
C. Greg Plaxton, Mitul Tiwari, Praveen Yalagandula