Search Sciweavers | Sciweavers

827 search results - page 76 / 166

» Variational methods for Reinforcement Learning

238

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 2 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

186

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

16 years 1 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

219

click to vote

IDEAL
2004
Springer

124views Intelligent Agents» more IDEAL 2004»

Learning Users' Interests in a Market-Based Recommender System

16 years 18 days ago

Download eprints.ecs.soton.ac.uk

Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...

Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings

claim paper

Read More »

206

click to vote

AOIS
2004

171views Intelligent Agents» more AOIS 2004»

Market-Based Recommender Systems: Learning Users' Interests by Quality Classification

15 years 8 months ago

Download eprints.ecs.soton.ac.uk

Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...

Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings

claim paper

Read More »

178

click to vote

ICCV
2009
IEEE

444views Computer Vision» more ICCV 2009»

Learning Pedestrian Dynamics from the Real World

17 years 7 days ago

Download www.cs.ucf.edu

In this paper we describe a method to learn parameters which govern pedestrian motion by observing video data. Our learning framework is based on variational mode learning and a...

Paul Scovanner, Marshall Tappen

claim paper

Read More »

« Prev « First page 76 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers