Search Sciweavers | Sciweavers

9 search results - page 1 / 2

» Monte-Carlo Go Reinforcement Learning Experiments

179

click to vote

CIG
2006
IEEE

190views Applied Computing» more CIG 2006»

Monte-Carlo Go Reinforcement Learning Experiments

16 years 21 days ago

Download www.math-info.univ-paris5.fr

Abstract— This paper describes experiments using reinforcement learning techniques to compute pattern urgencies used during simulations performed in a Monte-Carlo Go architecture...

Bruno Bouzy, Guillaume Chaslot

claim paper

Read More »

160

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 7 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

192

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 7 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

200

click to vote

ICRA
2009
IEEE

138views Robotics» more ICRA 2009»

Which landmark is useful? Learning selection policies for navigation in unknown environments

16 years 1 months ago

Download europa.informatik.uni-freiburg.de

Abstract— In general, a mobile robot that operates in unknown environments has to maintain a map and has to determine its own location given the map. This introduces signiﬁcant...

Hauke Strasdat, Cyrill Stachniss, Wolfram Burgard

claim paper

Read More »

201

click to vote

ACG
2003
Springer

157views Computer Graphics» more ACG 2003»

Evaluation in Go by a Neural Network using Soft Segmentation

15 years 12 months ago

Download webdocs.cs.ualberta.ca

In this article a neural network architecture is presented that is able to build a soft segmentation of a two-dimensional input. This network architecture is applied to position ev...

Markus Enzenberger

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers