Search Sciweavers | Sciweavers

34 search results - page 6 / 7

» Towards Finite-Sample Convergence of Direct Reinforcement Le...

221

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 9 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

231

click to vote

CVPR
2008
IEEE

213views Computer Vision» more CVPR 2008»

Kernel-based learning of cast shadows from a physical model of light sources and surfaces for low-level segmentation

16 years 9 months ago

Download vision.gel.ulaval.ca

In background subtraction, cast shadows induce silhouette distortions and object fusions hindering performance of high level algorithms in scene monitoring. We introduce a nonpara...

André Zaccarin, Nicolas Martel-Brisson

claim paper

Read More »

211

click to vote

ICRA
2010
IEEE

162views Robotics» more ICRA 2010»

Adaptive multi-robot coordination: A game-theoretic perspective

15 years 6 months ago

Download teamcore.usc.edu

Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...

Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus

claim paper

Read More »

183

click to vote

CSB
2002
IEEE

109views Bioinformatics» more CSB 2002»

Towards Automatic Clustering of Protein Sequences

16 years 17 days ago

Download www.cs.unc.edu

Analyzing protein sequence data becomes increasingly important recently. Most previous work on this area has mainly focused on building classiﬁcation models. In this paper, we i...

Jiong Yang, Wei Wang 0010

claim paper

Read More »

226

click to vote

NIPS
2008

271views Information Technology» more NIPS 2008»

Goal-directed decision making in prefrontal cortex: a computational framework

15 years 9 months ago

Download www.princeton.edu

Research in animal learning and behavioral neuroscience has distinguished between two forms of action control: a habit-based form, which relies on stored action values, and a goal...

Matthew Botvinick, James An

claim paper

Read More »

« Prev « First page 6 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers