Sciweavers

163 search results - page 25 / 33
» Policy Gradient Methods for Robotics
Sort
View
NIPS
2008
13 years 9 months ago
Fitted Q-iteration by Advantage Weighted Regression
Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the re...
Gerhard Neumann, Jan Peters
ICRA
2003
IEEE
128views Robotics» more  ICRA 2003»
14 years 1 months ago
Avoiding unsafe states in manufacturing systems based on polynomial digraph algorithms
Abstract − A deadlock-free unsafe (DFU) state of Resource Allocation System (RAS) is deadlock-free but inevitable to enter a deadlock state. Previous research revealed that in ma...
Yin Wang, Zhiming Wu
AROBOTS
2007
128views more  AROBOTS 2007»
13 years 8 months ago
Visual homing in environments with anisotropic landmark distribution
Gradient descent in image distances can lead a navigating agent to the goal location, but in environments with an anisotropic distribution of landmarks, gradient home vectors devia...
Ralf Möller, Andrew Vardy, Sven Kreft, Sebast...
BC
2006
132views more  BC 2006»
13 years 8 months ago
Local visual homing by matched-filter descent in image distances
Abstract In natural images, the distance measure between two images taken at different locations rises smoothly with increasing distance between the locations. This fact can be exp...
Ralf Möller, Andrew Vardy
ICRA
2010
IEEE
130views Robotics» more  ICRA 2010»
13 years 6 months ago
Multi-robot coordination with periodic connectivity
Abstract— We consider the problem of multi-robot coordination subject to constraints on the configuration. Specifically, we examine the case in which a mobile network of robots...
Geoffrey Hollinger, Sanjiv Singh