Search Sciweavers | Sciweavers

165 search results - page 22 / 33

» Exploration and apprenticeship learning in reinforcement lea...

click to vote

IJCAI
2001

84views Artificial Intelligence» more IJCAI 2001»

Reinforcement Learning in Distributed Domains: Beyond Team Games

13 years 9 months ago

Download web.engr.oregonstate.edu

Using a distributed algorithm rather than a centralized one can be extremely beneficial in large search problems. In addition, the incorporation of machine learning techniques lik...

David Wolpert, Joseph Sill, Kagan Tumer

claim paper

Read More »

click to vote

AAAI
2007

122views Intelligent Agents» more AAAI 2007»

RETALIATE: Learning Winning Policies in First-Person Shooter Games

13 years 10 months ago

Download www.cse.lehigh.edu

In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team firstperson shooter games. RETALIATE has three crucial chara...

Megan Smith, Stephen Lee-Urban, Hector Muño...

claim paper

Read More »

click to vote

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

14 years 2 months ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

click to vote

Publication

233views

Sparse reward processes

12 years 6 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

13 years 5 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

« Prev « First page 22 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers