Search Sciweavers | Sciweavers

779 search results - page 15 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

215

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 8 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

187

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 5 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

188

click to vote

KES
2007
Springer

146views Information Technology» more KES 2007»

Making Financial Trading by Recurrent Reinforcement Learning

16 years 1 months ago

Download www.sms.dsems.unile.it

In this paper we propose a ﬁnancial trading system whose strategy is developed by means of an artiﬁcial neural network approach based on a recurrent reinforcement learning algo...

Francesco Bertoluzzo, Marco Corazza

claim paper

Read More »

126

click to vote

ATAL
2010
Springer

111views Intelligent Agents» more ATAL 2010»

Using spatial hints to improve policy reuse in a reinforcement learning agent

15 years 8 months ago

Download www.aamas-conference.org

Bruno Norberto da Silva, Alan K. Mackworth

claim paper

Read More »

205

click to vote

INTERSPEECH
2010

175views Signal Processing» more INTERSPEECH 2010»

Still talking to machines (cognitively speaking)

15 years 2 months ago

Download mi.eng.cam.ac.uk

This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...

Steve Young

claim paper

Read More »

« Prev « First page 15 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers