Search Sciweavers | Sciweavers

162 search results - page 18 / 33

» Off-Policy Temporal Difference Learning with Function Approx...

180

click to vote

IJCAI
2007

173views Artificial Intelligence» more IJCAI 2007»

Reinforcement Learning of Local Shape in the Game of Go

15 years 8 months ago

Download webdocs.cs.ualberta.ca

We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...

David Silver, Richard S. Sutton, Martin Mülle...

claim paper

Read More »

156

Voted

ICMLA
2007

92views Machine Learning» more ICMLA 2007»

Control of a re-entrant line manufacturing model with a reinforcement learning approach

15 years 8 months ago

Download www.smitlab.uc.edu

This paper presents the application of a reinforcement learning (RL) approach for the near-optimal control of a re-entrant line manufacturing (RLM) model. The RL approach utilizes...

José A. Ramírez-Hernández, Em...

claim paper

Read More »

184

Voted

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

192

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

16 years 7 months ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

181

click to vote

ML
2007
ACM

106views Machine Learning» more ML 2007»

Surrogate maximization/minimization algorithms and extensions

15 years 6 months ago

Download www.cs.ust.hk

Abstract Surrogate maximization (or minimization) (SM) algorithms are a family of algorithms that can be regarded as a generalization of expectation-maximization (EM) algorithms. A...

Zhihua Zhang, James T. Kwok, Dit-Yan Yeung

claim paper

Read More »

« Prev « First page 18 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers