Search Sciweavers | Sciweavers

995 search results - page 19 / 199

» Learning Useful Horn Approximations

click to vote

GECCO
2008
Springer

170views Optimization» more GECCO 2008»

Evolving prediction weights using evolution strategy

13 years 11 months ago

Download www.cs.bham.ac.uk

The evolution strategy is one of the strongest evolutionary algorithms for optimizing real-value vectors. In this paper, we study how to use it for the evolution of prediction wei...

Trung Hau Tran, Cédric Sanza, Yves Duthen

claim paper

Read More »

click to vote

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

14 years 10 months ago

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

click to vote

ICPR
2008
IEEE

173views Computer Vision» more ICPR 2008»

Solving quadratically constrained geometrical problems using lagrangian duality

14 years 11 months ago

Download www.maths.lth.se

In this paper we consider the problem of solving different pose and registration problems under rotational constraints. Traditionally, methods such as the iterative closest point ...

Carl Olsson, Anders Eriksson

claim paper

Read More »

click to vote

CCIA
2005
Springer

117views Artificial Intelligence» more CCIA 2005»

Direct Policy Search Reinforcement Learning for Robot Control

14 years 3 months ago

Download vicorob.udg.es

— This paper proposes a high-level Reinforcement Learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, whe...

Andres El-Fakdi, Marc Carreras, Narcís Palo...

claim paper

Read More »

click to vote

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

13 years 11 months ago

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

« Prev « First page 19 / 199 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers