Search Sciweavers | Sciweavers

1233 search results - page 82 / 247

» Feudal Reinforcement Learning

165

click to vote

ICMLA
2004

109views Machine Learning» more ICMLA 2004»

Variable resolution discretization in the joint space

15 years 7 months ago

Download highentropy.com

We present JoSTLe, an algorithm that performs value iteration on control problems with continuous actions, allowing this useful reinforcement learning technique to be applied to p...

Christopher K. Monson, David Wingate, Kevin D. Sep...

claim paper

Read More »

184

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 7 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

178

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 4 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

163

click to vote

ITNG
2007
IEEE

118views Information Technology» more ITNG 2007»

Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals

16 years 13 days ago

Download eprints.qut.edu.au

This paper presents the recognition of Handwritten Hindi Numerals based on the modified exponential membership function fitted to the fuzzy sets derived from normalized distance f...

Madasu Hanmandlu, J. Grover, Vamsi Krishna Madasu,...

claim paper

Read More »

171

click to vote

ATAL
2003
Springer

176views Intelligent Agents» more ATAL 2003»

A selection-mutation model for q-learning in multi-agent systems

15 years 11 months ago

Download www.personeel.unimaas.nl

Although well understood in the single-agent framework, the use of traditional reinforcement learning (RL) algorithms in multi-agent systems (MAS) is not always justiﬁed. The fe...

Karl Tuyls, Katja Verbeeck, Tom Lenaerts

claim paper

Read More »

« Prev « First page 82 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers