Search Sciweavers | Sciweavers

688 search results - page 52 / 138

» Using reinforcement learning to adapt an imitation task

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

13 years 5 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

click to vote

ECAI
2000
Springer

131views Artificial Intelligence» more ECAI 2000»

Autonomous Environment and Task Adaptation for Robotic Agents

14 years 6 days ago

Download ias.in.tum.de

This paper investigates the problem of improving the performance of general state-of-the-art robot control systems by autonomously adapting them to speciﬁc tasks and environments...

Michael Beetz, Thorsten Belker

claim paper

Read More »

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

14 years 8 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

click to vote

HIS
2004

168views Information Technology» more HIS 2004»

A Case-Based Recommender for Task Assignment in Heterogeneous Computing Systems

13 years 9 months ago

Download ceit.aut.ac.ir

Case-based reasoning (CBR) is a knowledge-based problem-solving technique, which is based on reuse of previous experiences. In this paper we propose a new model for static task as...

S. Ghanbari, Mohammad Reza Meybodi, Kambiz Badie

claim paper

Read More »

click to vote

CEC
2009
IEEE

102views Artificial Intelligence» more CEC 2009»

Lamarckian neuroevolution for visual control in the Quake II environment

14 years 2 months ago

Download nebl.cse.unr.edu

Abstract— A combination of backpropagation and neuroevolution is used to train a neural network visual controller for agents in the Quake II environment. The agents must learn to...

Matt Parker, Bobby D. Bryant

claim paper

Read More »

« Prev « First page 52 / 138 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers