Sciweavers

581 search results - page 73 / 117
» policy 2009
Sort
View
ICANN
2009
Springer
14 years 16 days ago
Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data
In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...
Alexander Hans, Steffen Udluft
CSE
2009
IEEE
13 years 12 months ago
Self-Adaptation of Fault Tolerance Requirements Using Contracts
Fault tolerance is a constant concern in data centers where servers have to run with a minimal level of failures. Changes on the operating conditions or on server demands, and var...
André Luiz B. Rodrigues, Leila N. Bezerra, ...
AIPS
2009
13 years 9 months ago
A Human-Aware Robot Task Planner
The growing presence of household robots in inhabited environments arises the need for new robot task planning techniques. These techniques should take into consideration not only...
Marcello Cirillo, Lars Karlsson, Alessandro Saffio...
ACL
2009
13 years 6 months ago
Reinforcement Learning for Mapping Instructions to Actions
In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...
S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...
CDC
2009
IEEE
126views Control Systems» more  CDC 2009»
13 years 6 months ago
Stochastic optimization for Markov modulated networks with application to delay constrained wireless scheduling
Abstract-- We consider a wireless system with a small number of delay constrained users and a larger number of users without delay constraints. We develop a scheduling algorithm th...
Michael J. Neely