Search Sciweavers | Sciweavers

146

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 5 months ago

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

138

click to vote

JUCS
2007

98views more JUCS 2007»

Focus of Attention in Reinforcement Learning

15 years 4 months ago

Download www.research.rutgers.edu

Abstract: Classiﬁcation-based reinforcement learning (RL) methods have recently been proposed as an alternative to the traditional value-function based methods. These methods use...

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

117

click to vote

MANSCI
2007

100views more MANSCI 2007»

Dynamic Assortment with Demand Learning for Seasonal Consumer Goods

15 years 4 months ago

Download web.mit.edu

Companies such as Zara and World Co. have recently implemented novel product development processes and supply chain architectures enabling them to make more product design and ass...

Felipe Caro, Jérémie Gallien

claim paper

Read More »

182

click to vote

MMS
2002

171views Information Technology» more MMS 2002»

Dynamic end-to-end QoS management middleware for distributed multimedia systems

15 years 4 months ago

Download www-itec.uni-klu.ac.at

Abstract. In this paper, we present a separable, reusable middleware solution that provides coordinated, end-to-end QoS management over any type of service component, and can use e...

Denise J. Ecklund, Vera Goebel, Thomas Plagemann, ...

claim paper

Read More »

117

click to vote

ATAL
2007
Springer

81views Intelligent Agents» more ATAL 2007»

Multiagent learning in adaptive dynamic systems

15 years 10 months ago

Download www.damas.ift.ulaval.ca

Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers