Search Sciweavers | Sciweavers

4544 search results - page 105 / 909

» Reinforcement Learning with Time

162

click to vote

JSW
2007

112views more JSW 2007»

The Challenge of Training New Architects: an Ontological and Reinforcement-Learning Methodology

15 years 6 months ago

Download www.academypublisher.com

— This paper describes the importance of new skilled architects in the discipline of Software and Enterprise Architecture. Architects are often idealized as super heroes with a l...

Anabel Fraga, Juan Lloréns

claim paper

Read More »

168

click to vote

NN
2002
Springer

113views Neural Networks» more NN 2002»

Control of exploitation-exploration meta-parameter in reinforcement learning

15 years 6 months ago

Download www.fil.ion.ucl.ac.uk

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...

Shin Ishii, Wako Yoshida, Junichiro Yoshimoto

claim paper

Read More »

157

click to vote

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

15 years 4 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

187

click to vote

ICASSP
2011
IEEE

204views Signal Processing» more ICASSP 2011»

Bayesian reinforcement learning for POMDP-based dialogue systems

14 years 10 months ago

Download mirlab.org

Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...

ShaoWei Png, Joelle Pineau

claim paper

Read More »

149

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 10 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

« Prev « First page 105 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers