Search Sciweavers | Sciweavers

200 search results - page 25 / 40

» Point-Based Policy Iteration

186

Voted

DATE
2008
IEEE

136views Hardware» more DATE 2008»

A Framework of Stochastic Power Management Using Hidden Markov Model

16 years 1 months ago

Download www.date-conference.com

- The effectiveness of stochastic power management relies on the accurate system and workload model and effective policy optimization. Workload modeling is a machine learning proce...

Ying Tan, Qinru Qiu

claim paper

Read More »

191

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

16 years 1 months ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

168

click to vote

IROS
2007
IEEE

144views Robotics» more IROS 2007»

Bipedal walking on rough terrain using manifold control

16 years 1 months ago

Download www.cse.wustl.edu

— This paper presents an algorithm for adapting periodic behavior to gradual shifts in task parameters. Since learning optimal control in high dimensional domains is subject to t...

Tom Erez, William D. Smart

claim paper

Read More »

202

Voted

SOUPS
2005
ACM

116views Security Privacy» more SOUPS 2005»

Usable security and privacy: a case study of developing privacy management tools

16 years 8 days ago

Download cups.cs.cmu.edu

Privacy is a concept which received relatively little attention during the rapid growth and spread of information technology through the 1980’s and 1990’s. Design to make info...

Carolyn Brodie, Clare-Marie Karat, John Karat, Jin...

claim paper

Read More »

180

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

« Prev « First page 25 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers