Search Sciweavers | Sciweavers

473 search results - page 18 / 95

» Optimal policy switching algorithms for reinforcement learni...

159

click to vote

DICTA
2007

132views Applied Computing» more DICTA 2007»

Fuzzy Model Based Recognition of Handwritten Hindi Characters

15 years 7 months ago

Download eprints.qut.edu.au

This paper presents the recognition of handwritten Hindi Characters based on the modified exponential membership function fitted to the fuzzy sets derived from features consisting...

Madasu Hanmandlu, O. V. Ramana Murthy, Vamsi Krish...

claim paper

Read More »

155

click to vote

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

15 years 4 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

145

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 6 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

171

click to vote

NIPS
1994

152views Information Technology» more NIPS 1994»

Finding Structure in Reinforcement Learning

15 years 7 months ago

Download www.ri.cmu.edu

Reinforcement learning addresses the problem of learning to select actions in order to maximize one's performance inunknownenvironments. Toscale reinforcement learning to com...

Sebastian Thrun, Anton Schwartz

claim paper

Read More »

141

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

15 years 7 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

« Prev « First page 18 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers