Search Sciweavers | Sciweavers

87 search results - page 7 / 18

» Hybrid Least-Squares Algorithms for Approximate Policy Evalu...

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

13 years 9 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

click to vote

ICCV
2009
IEEE

309views Computer Vision» more ICCV 2009»

Local distance functions: A taxonomy, new algorithms, and an evaluation

13 years 5 months ago

Download www.ics.uci.edu

We present a taxonomy for local distance functions where most existing algorithms can be regarded as approximations of the geodesic distance defined by a metric tensor. We categor...

Deva Ramanan, Simon Baker

claim paper

Read More »

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

13 years 2 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

click to vote

ICRA
2008
IEEE

169views Robotics» more ICRA 2008»

Sparse incremental learning for interactive robot control policy estimation

14 years 2 months ago

Download www.cs.brown.edu

— We are interested in transferring control policies for arbitrary tasks from a human to a robot. Using interactive demonstration via teloperation as our transfer scenario, we ca...

Daniel H. Grollman, Odest Chadwicke Jenkins

claim paper

Read More »

click to vote

GECCO
2007
Springer

173views Optimization» more GECCO 2007»

A hybrid GA for a supply chain production planning problem

14 years 2 months ago

Download www.cs.bham.ac.uk

The problem of production and delivery lot-sizing and scheduling of set of items in a two-echelon supply chain over a finite planning horizon is addressed in this paper. A single ...

Masoud Jenabi, S. Ali Torabi, S. Afshin Mansouri

claim paper

Read More »

« Prev « First page 7 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers