Search Sciweavers | Sciweavers

827 search results - page 60 / 166

» Variational methods for Reinforcement Learning

194

click to vote

ROBOCUP
2007
Springer

167views Robotics» more ROBOCUP 2007»

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

16 years 1 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...

Kentarou Noma, Yasutake Takahashi, Minoru Asada

claim paper

Read More »

163

click to vote

JMLR
2010

129views more JMLR 2010»

Efficient Multioutput Gaussian Processes through Variational Inducing Kernels

15 years 1 months ago

Download jmlr.csail.mit.edu

Interest in multioutput kernel methods is increasing, whether under the guise of multitask learning, multisensor networks or structured output data. From the Gaussian process pers...

Mauricio Alvarez, David Luengo, Michalis Titsias, ...

claim paper

Read More »

201

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 9 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

264

Voted

TMM
2010

199views Management» more TMM 2010»

Video Annotation Through Search and Graph Reinforcement Mining

15 years 1 months ago

Download vision.ece.ucsb.edu

Abstract--Unlimited vocabulary annotation of multimedia documents remains elusive despite progress solving the problem in the case of a small, fixed lexicon. Taking advantage of th...

Emily Moxley, Tao Mei, Bangalore S. Manjunath

claim paper

Read More »

170

click to vote

ICML
2009
IEEE

134views Machine Learning» more ICML 2009»

Discovering options from example trajectories

16 years 8 months ago

Download www.cc.gatech.edu

We present a novel technique for automated problem decomposition to address the problem of scalability in reinforcement learning. Our technique makes use of a set of near-optimal ...

Peng Zang, Peng Zhou, David Minnen, Charles Lee Is...

claim paper

Read More »

« Prev « First page 60 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers