Sciweavers

34 search results - page 6 / 7
» Towards Finite-Sample Convergence of Direct Reinforcement Le...
Sort
View
NIPS
2008
13 years 8 months ago
Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms
Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...
John W. Roberts, Russ Tedrake
CVPR
2008
IEEE
14 years 9 months ago
Kernel-based learning of cast shadows from a physical model of light sources and surfaces for low-level segmentation
In background subtraction, cast shadows induce silhouette distortions and object fusions hindering performance of high level algorithms in scene monitoring. We introduce a nonpara...
André Zaccarin, Nicolas Martel-Brisson
ICRA
2010
IEEE
162views Robotics» more  ICRA 2010»
13 years 6 months ago
Adaptive multi-robot coordination: A game-theoretic perspective
Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...
Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus
CSB
2002
IEEE
109views Bioinformatics» more  CSB 2002»
14 years 12 days ago
Towards Automatic Clustering of Protein Sequences
Analyzing protein sequence data becomes increasingly important recently. Most previous work on this area has mainly focused on building classification models. In this paper, we i...
Jiong Yang, Wei Wang 0010
NIPS
2008
13 years 8 months ago
Goal-directed decision making in prefrontal cortex: a computational framework
Research in animal learning and behavioral neuroscience has distinguished between two forms of action control: a habit-based form, which relies on stored action values, and a goal...
Matthew Botvinick, James An