Search Sciweavers | Sciweavers

69 search results - page 11 / 14

» PAC-Bayesian Policy Evaluation for Reinforcement Learning

153

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 7 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

173

click to vote

ACL
2008

127views Computational Linguistics» more ACL 2008»

Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation

15 years 7 months ago

Download www.aclweb.org

We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...

Verena Rieser, Oliver Lemon

claim paper

Read More »

130

click to vote

ESANN
2008

123views Neural Networks» more ESANN 2008»

Safe exploration for reinforcement learning

15 years 7 months ago

Download ahans.de

In this paper we define and address the problem of safe exploration in the context of reinforcement learning. Our notion of safety is concerned with states or transitions that can ...

Alexander Hans, Daniel Schneegaß, Anton Maxi...

claim paper

Read More »

185

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 6 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

170

click to vote

PKDD
2009
Springer

152views Data Mining» more PKDD 2009»

Feature Selection for Value Function Approximation Using Bayesian Model Selection

16 years 12 days ago

Download userweb.cs.utexas.edu

Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...

Tobias Jung, Peter Stone

claim paper

Read More »

« Prev « First page 11 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers