Search Sciweavers | Sciweavers

226 search results - page 36 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

click to vote

CACM
2010

105views more CACM 2010»

Censored exploration and the dark pool problem

13 years 7 months ago

Download www.cis.upenn.edu

We introduce and analyze a natural algorithm for multi-venue exploration from censored data, which is motivated by the Dark Pool Problem of modern quantitative finance. We prove t...

Kuzman Ganchev, Yuriy Nevmyvaka, Michael Kearns, J...

claim paper

Read More »

click to vote

SIGDIAL
2010

158views Natural Language Processing» more SIGDIAL 2010»

Sparse Approximate Dynamic Programming for Dialog Management

13 years 5 months ago

Download www.sigdial.org

Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...

Senthilkumar Chandramohan, Matthieu Geist, Olivier...

claim paper

Read More »

click to vote

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

13 years 7 months ago

Download www.idiap.ch

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

13 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

« Prev « First page 36 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers