Sciweavers

369 search results - page 34 / 74
» Global Optimization for Value Function Approximation
Sort
View
EC
2011
240views ECommerce» more  EC 2011»
13 years 3 months ago
HypE: An Algorithm for Fast Hypervolume-Based Many-Objective Optimization
Abstract—In the field of evolutionary multi-criterion optimization, the hypervolume indicator is the only single set quality measure that is known to be strictly monotonic with ...
Johannes Bader, Eckart Zitzler
PKDD
2009
Springer
169views Data Mining» more  PKDD 2009»
14 years 3 months ago
Hybrid Least-Squares Algorithms for Approximate Policy Evaluation
The goal of approximate policy evaluation is to “best” represent a target value function according to a specific criterion. Temporal difference methods and Bellman residual m...
Jeffrey Johns, Marek Petrik, Sridhar Mahadevan
IJRR
2007
104views more  IJRR 2007»
13 years 8 months ago
Deep Space Formation Flying Spacecraft Path Planning
Efficient algorithms for collision-free energy sub-optimal path planning for formations of spacecraft flying in deep space are presented. The idea is to introduce a set of way-poi...
Cornel Sultan, Sanjeev Seereeram, Raman K. Mehra
NIPS
2001
13 years 10 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
ICRA
2002
IEEE
120views Robotics» more  ICRA 2002»
14 years 1 months ago
Workspace Optimization of 3-UPU Parallel Platforms with Joint Constraints
In this paper the workspace optimization of translational 3-UPU parallel platforms with prismatic and universal joint constraints is performed. The workspace is parameterized usin...
Mircea Badescu, Jeremy Morman, Constantinos Mavroi...