Search Sciweavers | Sciweavers

165 search results - page 20 / 33

» Exploration and apprenticeship learning in reinforcement lea...

click to vote

ICDCSW
2006
IEEE

133views Computer Networks» more ICDCSW 2006»

Improve Searching by Reinforcement Learning in Unstructured P2Ps

14 years 3 months ago

Download www.cse.fau.edu

— Existing searching schemes in unstructured P2Ps can be categorized as either blind or informed. The quality of query results in blind schemes is low. Informed schemes use simpl...

Xiuqi Li, Jie Wu

claim paper

Read More »

click to vote

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

14 years 2 months ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

click to vote

IJCNN
2006
IEEE

121views Neural Networks» more IJCNN 2006»

Learning a Rendezvous Task with Dynamic Joint Action Perception

14 years 3 months ago

Download axon.cs.byu.edu

Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...

Nancy Fulda, Dan Ventura

claim paper

Read More »

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

14 years 9 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

14 years 3 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

« Prev « First page 20 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers