Search Sciweavers | Sciweavers

45 search results - page 7 / 9

» Policy Learning - A Unified Perspective with Applications in...

148

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Fitted Q-iteration by Advantage Weighted Regression

15 years 7 months ago

Download www.kyb.mpg.de

Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the re...

Gerhard Neumann, Jan Peters

claim paper

Read More »

177

click to vote

IJRR
2008

139views more IJRR 2008»

Learning to Control in Operational Space

15 years 6 months ago

Download www.kyb.tuebingen.mpg.de

One of the most general frameworks for phrasing control problems for complex, redundant robots is operational space control. However, while this framework is of essential importan...

Jan Peters, Stefan Schaal

claim paper

Read More »

179

click to vote

AIIA
2007
Springer

147views Artificial Intelligence» more AIIA 2007»

Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions

16 years 11 days ago

Download sequel.futurs.inria.fr

The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...

Andrea Bonarini, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

163

click to vote

WWW
2007
ACM

106views Internet Technology» more WWW 2007»

Exposing private information by timing web applications

16 years 6 months ago

Download crypto.stanford.edu

We show that the time web sites take to respond to HTTP requests can leak private information, using two different types of attacks. The first, direct timing, directly measures re...

Andrew Bortz, Dan Boneh

claim paper

Read More »

163

click to vote

IROS
2008
IEEE

118views Robotics» more IROS 2008»

Laban Movement Analysis for multi-ocular systems

16 years 17 days ago

Download mail.isr.uc.pt

Abstract— We present as a contribution to the ﬁeld of humanmachine interaction a system that analyzes human movements online through multiple observers, based on the concept of...

Jörg Rett, Luis Santos, Jorge Dias

claim paper

Read More »

« Prev « First page 7 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers