Search Sciweavers | Sciweavers

509 search results - page 33 / 102

» Compositional Models for Reinforcement Learning

199

click to vote

JSAC
2010

107views more JSAC 2010»

Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

15 years 5 months ago

Download medianetlab.ee.ucla.edu

Abstract—In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring,...

Hsien-Po Shiang, Mihaela van der Schaar

claim paper

Read More »

192

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

16 years 1 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

177

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 8 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

187

click to vote

RAS
2000

161views more RAS 2000»

Active object recognition by view integration and reinforcement learning

15 years 6 months ago

Download www.emt.tu-graz.ac.at

A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...

Lucas Paletta, Axel Pinz

claim paper

Read More »

185

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 33 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers