Search Sciweavers | Sciweavers

The creation of Answer Communities around a FAQs Site is proposed to speed up the process of answering questions. Our approach combines long-term and short-term rewards. Long-term ...

Araceli Moreno, Josep Lluís de la Rosa, Bol...

claim paper

Read More »

232

click to vote

ARCS
2005
Springer

261views Software Engineering» more ARCS 2005»

Adaptive Object Acquisition

16 years 6 days ago

Download www.organic-computing.org

We propose an active vision system for object acquisition. The core of our approach is a reinforcement learning module which learns a strategy to scan an object. The agent moves a...

Gabriele Peters, Claus-Peter Alberts, Markus Bries...

claim paper

Read More »

214

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 1 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

193

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

16 years 1 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 7 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers