Search Sciweavers | Sciweavers

651 search results - page 47 / 131

» Algorithms for Inverse Reinforcement Learning

158

click to vote

CONSTRAINTS
2008

89views more CONSTRAINTS 2008»

A Reinforcement Learning Approach to Interval Constraint Propagation

15 years 6 months ago

Download www.crt.umontreal.ca

When solving systems of nonlinear equations with interval constraint methods, it has often been observed that many calls to contracting operators do not participate actively to th...

Frédéric Goualard, Christophe Jerman...

claim paper

Read More »

183

click to vote

ICASSP
2011
IEEE

204views Signal Processing» more ICASSP 2011»

Bayesian reinforcement learning for POMDP-based dialogue systems

14 years 10 months ago

Download mirlab.org

Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...

ShaoWei Png, Joelle Pineau

claim paper

Read More »

162

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

16 years 10 days ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

153

click to vote

NECO
2010

103views more NECO 2010»

Posterior Weighted Reinforcement Learning with State Uncertainty

15 years 4 months ago

Download www.maths.bris.ac.uk

Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn f...

Tobias Larsen, David S. Leslie, Edmund J. Collins,...

claim paper

Read More »

178

click to vote

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

16 years 7 months ago

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...

John Langford, Bianca Zadrozny

claim paper

Read More »

« Prev « First page 47 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers