Sciweavers

651 search results - page 73 / 131
» Algorithms for Inverse Reinforcement Learning
Sort
View
CIKM
2000
Springer
14 years 2 months ago
Relevance and Reinforcement in Interactive Browsing
We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...
Anton Leuski
IJCAI
2001
13 years 11 months ago
Rational and Convergent Learning in Stochastic Games
This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...
Michael H. Bowling, Manuela M. Veloso
ICRA
2010
IEEE
153views Robotics» more  ICRA 2010»
13 years 8 months ago
Learning to navigate through crowded environments
— The goal of this research is to enable mobile robots to navigate through crowded environments such as indoor shopping malls, airports, or downtown side walks. The key research ...
Peter Henry, Christian Vollmer, Brian Ferris, Diet...
NIPS
2003
13 years 11 months ago
Approximate Planning in POMDPs with Macro-Actions
Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...
Georgios Theocharous, Leslie Pack Kaelbling
CORR
2012
Springer
216views Education» more  CORR 2012»
12 years 6 months ago
Fractional Moments on Bandit Problems
Reinforcement learning addresses the dilemma between exploration to find profitable actions and exploitation to act according to the best observations already made. Bandit proble...
Ananda Narayanan B., Balaraman Ravindran