Sciweavers

1176 search results - page 7 / 236
» Sparse reward processes
Sort
View
114
Voted
CCIA
2009
Springer
15 years 4 months ago
Reward System for Completing FAQs
The creation of Answer Communities around a FAQs Site is proposed to speed up the process of answering questions. Our approach combines long-term and short-term rewards. Long-term ...
Araceli Moreno, Josep Lluís de la Rosa, Bol...
155
Voted
ARCS
2005
Springer
15 years 9 months ago
Adaptive Object Acquisition
We propose an active vision system for object acquisition. The core of our approach is a reinforcement learning module which learns a strategy to scan an object. The agent moves a...
Gabriele Peters, Claus-Peter Alberts, Markus Bries...
163
Voted
JMLR
2010
189views more  JMLR 2010»
14 years 10 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
156
Voted
FOCS
2007
IEEE
15 years 10 months ago
Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards
We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...
Sudipto Guha, Kamesh Munagala