Search Sciweavers | Sciweavers

166 search results - page 19 / 34

» Safe exploration for reinforcement learning

166

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

155

click to vote

ICDCSW
2006
IEEE

133views Computer Networks» more ICDCSW 2006»

Improve Searching by Reinforcement Learning in Unstructured P2Ps

16 years 7 days ago

Download www.cse.fau.edu

— Existing searching schemes in unstructured P2Ps can be categorized as either blind or informed. The quality of query results in blind schemes is low. Informed schemes use simpl...

Xiuqi Li, Jie Wu

claim paper

Read More »

142

click to vote

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

15 years 11 months ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

171

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 19 days ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

224

click to vote

WAPCV
2007
Springer

188views Computer Vision» more WAPCV 2007»

Reinforcement Learning for Decision Making in Sequential Visual Attention

16 years 9 days ago

Download www.mobvis.org

The innovation of this work is the provision of a system that learns visual encodings of attention patterns and that enables sequential attention for object detection in real world...

Lucas Paletta, Gerald Fritz

claim paper

Read More »

« Prev « First page 19 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers