Search Sciweavers | Sciweavers

183

CSL
2010
Springer

163views Automated Reasoning» more CSL 2010»

Evaluation of a hierarchical reinforcement learning spoken dialogue system

15 years 6 months ago

We describe an evaluation of spoken dialogue strategies designed using hierarchical reinforcement learning agents. The dialogue strategies were learnt in a simulated environment a...

Heriberto Cuayáhuitl, Steve Renals, Oliver ...

claim paper

Read More »

167

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 4 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

146

click to vote

CEC
2008
IEEE

116views Artificial Intelligence» more CEC 2008»

Creating edge detectors by evolutionary reinforcement learning

16 years 14 days ago

Download www.ks.informatik.uni-kiel.de

— In this article we present results from experiments where a edge detector was learned from scratch by EANT2, a method for evolutionary reinforcement learning. The detector is c...

Nils T. Siebel, Sven Grünewald, Gerald Sommer

claim paper

Read More »

149

click to vote

IROS
2007
IEEE

144views Robotics» more IROS 2007»

Using reinforcement learning to adapt an imitation task

16 years 9 days ago

Download lasa.epfl.ch

Abstract— The goal of developing algorithms for programming robots by demonstration is to create an easy way of programming robots that can be accomplished by everyone. When a de...

Florent Guenter, Aude Billard

claim paper

Read More »

149

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 11 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers