Search Sciweavers | Sciweavers

2566 search results - page 156 / 514

» Relating reinforcement learning performance to classificatio...

139

ICCBR
2010
Springer

274views Automated Reasoning» more ICCBR 2010»

Reducing the Memory Footprint of Temporal Difference Learning over Finitely Many States by Using Case-Based Generalization

15 years 8 months ago

Download www.cse.lehigh.edu

In this paper we present an approach for reducing the memory footprint requirement of temporal difference methods in which the set of states is finite. We use case-based generaliza...

Matt Dilts, Héctor Muñoz-Avila

claim paper

Read More »

138

click to vote

NIPS
1993

123views Information Technology» more NIPS 1993»

Temporal Difference Learning of Position Evaluation in the Game of Go

15 years 5 months ago

Download www.gatsby.ucl.ac.uk

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...

Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...

claim paper

Read More »

130

click to vote

AR
2002

157views more AR 2002»

Acquiring state from control dynamics to learn grasping policies for robot hands

15 years 4 months ago

Download www.mit.edu

Abstract--A prominent emerging theory of sensorimotor development in biological systems proposes that control knowledge is encoded in the dynamics of physical interaction with the ...

Roderic A. Grupen, Jefferson A. Coelho Jr.

claim paper

Read More »

131

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 8 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

151

click to vote

ECLIPSE
2007
ACM

143views Software Engineering» more ECLIPSE 2007»

Predicting buggy changes inside an integrated development environment

15 years 8 months ago

Download www.cs.mcgill.ca

We present a tool that predicts whether the software under development inside an IDE has a bug. An IDE plugin performs this prediction, using the Change Classification technique t...

Janaki T. Madhavan, E. James Whitehead Jr.

claim paper

Read More »

« Prev « First page 156 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers