Search Sciweavers | Sciweavers

168

ICANN
2009
Springer

123views Neural Networks» more ICANN 2009»

Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

15 years 10 months ago

In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...

Alexander Hans, Steffen Udluft

claim paper

Read More »

160

click to vote

CSE
2009
IEEE

149views Theoretical Computer Science» more CSE 2009»

Self-Adaptation of Fault Tolerance Requirements Using Contracts

15 years 9 months ago

Download www.tempo.uff.br

Fault tolerance is a constant concern in data centers where servers have to run with a minimal level of failures. Changes on the operating conditions or on server demands, and var...

André Luiz B. Rodrigues, Leila N. Bezerra, ...

claim paper

Read More »

155

click to vote

AIPS
2009

139views Artificial Intelligence» more AIPS 2009»

A Human-Aware Robot Task Planner

15 years 7 months ago

Download aass.oru.se

The growing presence of household robots in inhabited environments arises the need for new robot task planning techniques. These techniques should take into consideration not only...

Marcello Cirillo, Lars Karlsson, Alessandro Saffio...

claim paper

Read More »

144

click to vote

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

15 years 3 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

269

click to vote

CDC
2009
IEEE

126views Control Systems» more CDC 2009»

Stochastic optimization for Markov modulated networks with application to delay constrained wireless scheduling

15 years 3 months ago

Download www-bcf.usc.edu

Abstract-- We consider a wireless system with a small number of delay constrained users and a larger number of users without delay constraints. We develop a scheduling algorithm th...

Michael J. Neely

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers