Search Sciweavers | Sciweavers

473 search results - page 59 / 95

» Optimal policy switching algorithms for reinforcement learni...

179

click to vote

ML
1998
ACM

148views Machine Learning» more ML 1998»

Colearning in Differential Games

15 years 5 months ago

Download www.cs.jhu.edu

Game playing has been a popular problem area for research in artiﬁcial intelligence and machine learning for many years. In almost every study of game playing and machine learnin...

John W. Sheppard

claim paper

Read More »

155

click to vote

GECCO
2004
Springer

155views Optimization» more GECCO 2004»

Genetic Network Programming with Reinforcement Learning and Its Performance Evaluation

15 years 11 months ago

Download www.cs.york.ac.uk

A new graph-based evolutionary algorithm named “Genetic Network Programming, GNP” has been proposed. GNP represents its solutions as directed graph structures, which can improv...

Shingo Mabu, Kotaro Hirasawa, Jinglu Hu

claim paper

Read More »

149

click to vote

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

16 years 6 months ago

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

150

click to vote

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 5 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

172

click to vote

ACL
2008

127views Computational Linguistics» more ACL 2008»

Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation

15 years 7 months ago

Download www.aclweb.org

We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...

Verena Rieser, Oliver Lemon

claim paper

Read More »

« Prev « First page 59 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers