Search Sciweavers | Sciweavers

802 search results - page 161 / 161

» Experts in a Markov Decision Process

222

click to vote

CORR
2012
Springer

229views Education» more CORR 2012»

Cops and Invisible Robbers: the Cost of Drunkenness

14 years 3 months ago

Download www.math.ryerson.ca

We examine a version of the Cops and Robber (CR) game in which the robber is invisible, i.e., the cops do not know his location until they capture him. Apparently this game (CiR) h...

Athanasios Kehagias, Dieter Mitsche, Pawel Pralat

claim paper

Read More »

228

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 8 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 161 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers