Search Sciweavers | Sciweavers

50 search results - page 6 / 10

» Nonparametric Return Distribution Approximation for Reinforc...

121

click to vote

NIPS
2007

80views Information Technology» more NIPS 2007»

Stable Dual Dynamic Programming

15 years 3 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

116

Voted

KDD
2002
ACM

147views Data Mining» more KDD 2002»

Sequential cost-sensitive decision making with reinforcement learning

16 years 2 months ago

Download www.research.ibm.com

Recently, there has been increasing interest in the issues of cost-sensitive learning and decision making in a variety of applications of data mining. A number of approaches have ...

Edwin P. D. Pednault, Naoki Abe, Bianca Zadrozny

claim paper

Read More »

132

click to vote

CVPR
2008
IEEE

213views Computer Vision» more CVPR 2008»

Kernel-based learning of cast shadows from a physical model of light sources and surfaces for low-level segmentation

16 years 4 months ago

Download vision.gel.ulaval.ca

In background subtraction, cast shadows induce silhouette distortions and object fusions hindering performance of high level algorithms in scene monitoring. We introduce a nonpara...

André Zaccarin, Nicolas Martel-Brisson

claim paper

Read More »

116

Voted

ATAL
2008
Springer

131views Intelligent Agents» more ATAL 2008»

A new perspective to the keepaway soccer: the takers

15 years 4 months ago

Download www.aamas-conference.org

Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...

Atil Iscen, Umut Erogul

claim paper

Read More »

130

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

15 years 8 months ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

« Prev « First page 6 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers