Search Sciweavers | Sciweavers

51 search results - page 5 / 11

» Characterizing reinforcement learning methods through parame...

207

click to vote

GECCO
2008
Springer

182views Optimization» more GECCO 2008»

Scaling ant colony optimization with hierarchical reinforcement learning partitioning

15 years 7 months ago

Download www.cs.bham.ac.uk

This paper merges hierarchical reinforcement learning (HRL) with ant colony optimization (ACO) to produce a HRL ACO algorithm capable of generating solutions for large domains. Th...

Erik J. Dries, Gilbert L. Peterson

claim paper

Read More »

238

click to vote

TMM
2010

199views Management» more TMM 2010»

Video Annotation Through Search and Graph Reinforcement Mining

15 years 1 months ago

Download vision.ece.ucsb.edu

Abstract--Unlimited vocabulary annotation of multimedia documents remains elusive despite progress solving the problem in the case of a small, fixed lexicon. Taking advantage of th...

Emily Moxley, Tao Mei, Bangalore S. Manjunath

claim paper

Read More »

182

Voted

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

194

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 4 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

189

Voted

ECCV
2002
Springer

236views Computer Vision» more ECCV 2002»

Multimodal Data Representations with Parameterized Local Structures

16 years 8 months ago

Download www.caip.rutgers.edu

Abstract. In many vision problems, the observed data lies in a nonlinear manifold in a high-dimensional space. This paper presents a generic modelling scheme to characterize the no...

Ying Zhu, Dorin Comaniciu, Stuart C. Schwartz, Vis...

claim paper

Read More »

« Prev « First page 5 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers