Search Sciweavers | Sciweavers

651 search results - page 73 / 131

» Algorithms for Inverse Reinforcement Learning

134

click to vote

CIKM
2000
Springer

104views Information Technology» more CIKM 2000»

Relevance and Reinforcement in Interactive Browsing

15 years 10 months ago

Download ciir.cs.umass.edu

We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...

Anton Leuski

claim paper

Read More »

180

click to vote

IJCAI
2001

119views Artificial Intelligence» more IJCAI 2001»

Rational and Convergent Learning in Stochastic Games

15 years 7 months ago

Download reference.kfupm.edu.sa

This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

169

click to vote

ICRA
2010
IEEE

153views Robotics» more ICRA 2010»

Learning to navigate through crowded environments

15 years 4 months ago

Download www.cs.washington.edu

— The goal of this research is to enable mobile robots to navigate through crowded environments such as indoor shopping malls, airports, or downtown side walks. The key research ...

Peter Henry, Christian Vollmer, Brian Ferris, Diet...

claim paper

Read More »

172

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 7 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

233

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

14 years 1 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

« Prev « First page 73 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers