Search Sciweavers | Sciweavers

194

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 8 months ago

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

204

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

271

click to vote

TMM
2010

199views Management» more TMM 2010»

Video Annotation Through Search and Graph Reinforcement Mining

15 years 2 months ago

Download vision.ece.ucsb.edu

Abstract--Unlimited vocabulary annotation of multimedia documents remains elusive despite progress solving the problem in the case of a small, fixed lexicon. Taking advantage of th...

Emily Moxley, Tao Mei, Bangalore S. Manjunath

claim paper

Read More »

187

click to vote

CCGRID
2008
IEEE

127views Distributed And Parallel Com...» more CCGRID 2008»

Grid Differentiated Services: A Reinforcement Learning Approach

16 years 1 months ago

Download hal.inria.fr

—Large scale production grids are a major case for autonomic computing. Following the classical deﬁnition of Kephart, an autonomic computing system should optimize its own beha...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

176

click to vote

FUZZIEEE
2007
IEEE

132views Fuzzy Logic» more FUZZIEEE 2007»

Fuzzy Approximation for Convergent Model-Based Reinforcement Learning

16 years 1 months ago

Download www.montefiore.ulg.ac.be

— Reinforcement learning (RL) is a learning control paradigm that provides well-understood algorithms with good convergence and consistency properties. Unfortunately, these algor...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers