Sciweavers

45 search results - page 7 / 9
» Policy Learning - A Unified Perspective with Applications in...
Sort
View
NIPS
2008
13 years 8 months ago
Fitted Q-iteration by Advantage Weighted Regression
Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the re...
Gerhard Neumann, Jan Peters
IJRR
2008
139views more  IJRR 2008»
13 years 7 months ago
Learning to Control in Operational Space
One of the most general frameworks for phrasing control problems for complex, redundant robots is operational space control. However, while this framework is of essential importan...
Jan Peters, Stefan Schaal
AIIA
2007
Springer
14 years 1 months ago
Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions
The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...
Andrea Bonarini, Alessandro Lazaric, Marcello Rest...
WWW
2007
ACM
14 years 7 months ago
Exposing private information by timing web applications
We show that the time web sites take to respond to HTTP requests can leak private information, using two different types of attacks. The first, direct timing, directly measures re...
Andrew Bortz, Dan Boneh
IROS
2008
IEEE
118views Robotics» more  IROS 2008»
14 years 1 months ago
Laban Movement Analysis for multi-ocular systems
Abstract— We present as a contribution to the field of humanmachine interaction a system that analyzes human movements online through multiple observers, based on the concept of...
Jörg Rett, Luis Santos, Jorge Dias