Search Sciweavers | Sciweavers

779 search results - page 11 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

118

Voted

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 3 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

118

click to vote

ACL
2008

127views Computational Linguistics» more ACL 2008»

Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation

15 years 3 months ago

Download www.aclweb.org

We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...

Verena Rieser, Oliver Lemon

claim paper

Read More »

136

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

15 years 11 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

129

click to vote

IJAIT
2008

146views more IJAIT 2008»

Learning to Behave in Space: a Qualitative Spatial Representation for Robot Navigation with Reinforcement Learning

15 years 2 months ago

Download www.aussagekraft.de

ion mechanism to create a representation of space consisting of the circular order of detected landmarks and the relative position of walls towards the agent's moving directio...

Lutz Frommberger

claim paper

Read More »

135

click to vote

ICRA
2009
IEEE

138views Robotics» more ICRA 2009»

Which landmark is useful? Learning selection policies for navigation in unknown environments

15 years 9 months ago

Download europa.informatik.uni-freiburg.de

Abstract— In general, a mobile robot that operates in unknown environments has to maintain a map and has to determine its own location given the map. This introduces signiﬁcant...

Hauke Strasdat, Cyrill Stachniss, Wolfram Burgard

claim paper

Read More »

« Prev « First page 11 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers