Search Sciweavers | Sciweavers

567 search results - page 59 / 114

» Regularized Policy Iteration

184

click to vote

IROS
2007
IEEE

144views Robotics» more IROS 2007»

Bipedal walking on rough terrain using manifold control

16 years 1 months ago

Download www.cse.wustl.edu

— This paper presents an algorithm for adapting periodic behavior to gradual shifts in task parameters. Since learning optimal control in high dimensional domains is subject to t...

Tom Erez, William D. Smart

claim paper

Read More »

220

click to vote

SOUPS
2005
ACM

116views Security Privacy» more SOUPS 2005»

Usable security and privacy: a case study of developing privacy management tools

16 years 19 days ago

Download cups.cs.cmu.edu

Privacy is a concept which received relatively little attention during the rapid growth and spread of information technology through the 1980’s and 1990’s. Design to make info...

Carolyn Brodie, Clare-Marie Karat, John Karat, Jin...

claim paper

Read More »

191

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

214

click to vote

JCDL
2005
ACM

161views Education» more JCDL 2005»

Downloading textual hidden web content through keyword queries

16 years 19 days ago

Download oak.cs.ucla.edu

An ever-increasing amount of information on the Web today is available only through search interfaces: the users have to type in a set of keywords in a search form in order to acc...

Alexandros Ntoulas, Petros Zerfos, Junghoo Cho

claim paper

Read More »

203

click to vote

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

15 years 9 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

« Prev « First page 59 / 114 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers