Search Sciweavers | Sciweavers

1233 search results - page 188 / 247

» Feudal Reinforcement Learning

187

click to vote

SIGGRAPH
2010
ACM

248views Computer Graphics» more SIGGRAPH 2010»

Gesture controllers

15 years 10 months ago

Download graphics.stanford.edu

We introduce gesture controllers, a method for animating the body language of avatars engaged in live spoken conversation. A gesture controller is an optimal-policy controller tha...

Sergey Levine, Philipp Krähenbühl, Sebastian Thr...

claim paper

Read More »

142

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 5 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

172

Voted

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 3 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

198

click to vote

JMLR
2010

141views more JMLR 2010»

Pinview: Implicit Feedback in Content-Based Image Retrieval

15 years 19 days ago

Download jmlr.csail.mit.edu

This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...

Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...

claim paper

Read More »

155

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 9 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

« Prev « First page 188 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers