Sciweavers

1167 search results - page 114 / 234
» policy 2007
Sort
View
NIPS
1993
15 years 5 months ago
Robust Reinforcement Learning in Motion Planning
While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...
Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...
JUCS
2007
98views more  JUCS 2007»
15 years 4 months ago
Focus of Attention in Reinforcement Learning
Abstract: Classification-based reinforcement learning (RL) methods have recently been proposed as an alternative to the traditional value-function based methods. These methods use...
Lihong Li, Vadim Bulitko, Russell Greiner
MANSCI
2007
100views more  MANSCI 2007»
15 years 4 months ago
Dynamic Assortment with Demand Learning for Seasonal Consumer Goods
Companies such as Zara and World Co. have recently implemented novel product development processes and supply chain architectures enabling them to make more product design and ass...
Felipe Caro, Jérémie Gallien
MMS
2002
15 years 4 months ago
Dynamic end-to-end QoS management middleware for distributed multimedia systems
Abstract. In this paper, we present a separable, reusable middleware solution that provides coordinated, end-to-end QoS management over any type of service component, and can use e...
Denise J. Ecklund, Vera Goebel, Thomas Plagemann, ...
ATAL
2007
Springer
15 years 10 months ago
Multiagent learning in adaptive dynamic systems
Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...
Andriy Burkov, Brahim Chaib-draa