Sciweavers

371 search results - page 38 / 75
» An analysis of a Monte Carlo algorithm for estimating the pe...
Sort
View
NIPS
2001
13 years 9 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
ICIP
2005
IEEE
14 years 9 months ago
Tracking multiple cells by correspondence resolution in a sequential Bayesian framework
We propose a multi-target tracking (MTT) algorithm in a sequential Bayesian framework that computes cell velocities from video microscopy. Unlike the traditional tracking methods,...
Nilanjan Ray, Gang Dong, Scott T. Acton
TSP
2010
13 years 2 months ago
Efficient recursive estimators for a linear, time-varying Gaussian model with general constraints
The adaptive estimation of a time-varying parameter vector in a linear Gaussian model is considered where we a priori know that the parameter vector belongs to a known arbitrary s...
Stefan Uhlich, Bin Yang
CVPR
2005
IEEE
14 years 9 months ago
Learning to Estimate Human Pose with Data Driven Belief Propagation
We propose a statistical formulation for 2-D human pose estimation from single images. The human body configuration is modeled by a Markov network and the estimation problem is to...
Gang Hua, Ming-Hsuan Yang, Ying Wu
ICASSP
2011
IEEE
12 years 11 months ago
Joint data detection and dominant singular mode estimation in time varying reciprocal MIMO systems
This paper proposes an algorithm for joint data detection and tracking of the dominant singular mode of a time varying channel at the transmitter and receiver of a time division d...
Ranjitha Prasad, Bettagere Nagaraja Bharath, Chand...