Search Sciweavers | Sciweavers

4544 search results - page 55 / 909

» Reinforcement Learning with Time

176

click to vote

NAACL
2001

130views Computational Linguistics» more NAACL 2001»

Learning Optimal Dialogue Management Rules by Using Reinforcement Learning and Inductive Logic Programming

15 years 7 months ago

Download www.aclweb.org

Developing dialogue systems is a complex process. In particular, designing efficient dialogue management strategies is often difficult as there are no precise guidelines to develo...

Renaud Lecoeuche

claim paper

Read More »

155

click to vote

NIPS
1997

113views Information Technology» more NIPS 1997»

Nonparametric Model-Based Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

This paper describes some of the interactions of model learning algorithms and planning algorithms we have found in exploring model-based reinforcement learning. The paper focuses...

Christopher G. Atkeson

claim paper

Read More »

154

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 7 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

168

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 7 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

170

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 55 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers