Search Sciweavers | Sciweavers

133 search results - page 16 / 27

» Hierarchical Policy Gradient Algorithms

click to vote

SDM
2012
SIAM

281views Data Mining» more SDM 2012»

Contextual Collaborative Filtering via Hierarchical Matrix Factorization

12 years 7 days ago

Download www.cse.ust.hk

Matrix factorization (MF) has been demonstrated to be one of the most competitive techniques for collaborative ﬁltering. However, state-of-the-art MFs do not consider contextual...

ErHeng Zhong, Wei Fan, Qiang Yang

claim paper

Read More »

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

13 years 7 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

click to vote

ICPPW
2000
IEEE

79views Distributed And Parallel Com...» more ICPPW 2000»

Reducing Web Latency with Hierarchical Cache-Based Prefetching

14 years 2 months ago

Download www.dennis-strelow.com

Proxy caches have become a central mechanism for reducing the latency of web document retrieval. While caching alone reduces latency for previously requested documents, web docume...

Dan Foygel, Dennis Strelow

claim paper

Read More »

click to vote

ATAL
2007
Springer

141views Intelligent Agents» more ATAL 2007»

Commitment-driven distributed joint policy search

14 years 4 months ago

Download www-personal.umich.edu

Decentralized MDPs provide powerful models of interactions in multi-agent environments, but are often very diﬃcult or even computationally infeasible to solve optimally. Here we...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

click to vote

NIPS
2007

142views Information Technology» more NIPS 2007»

Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion

13 years 11 months ago

Download books.nips.cc

We consider apprenticeship learning—learning from expert demonstrations—in the setting of large, complex domains. Past work in apprenticeship learning requires that the expert...

J. Zico Kolter, Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 16 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers