Search Sciweavers | Sciweavers

166 search results - page 23 / 34

» Safe exploration for reinforcement learning

155

click to vote

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 6 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

144

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

15 years 8 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

124

click to vote

ATAL
2007
Springer

108views Intelligent Agents» more ATAL 2007»

Dynamic task allocation within an open service-oriented MAS architecture

16 years 10 days ago

Download www.isys.ucl.ac.be

A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding t...

Ivan Jureta, Stéphane Faulkner, Youssef Ach...

claim paper

Read More »

176

click to vote

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

16 years 25 days ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

286

click to vote

Publication

233views

Sparse reward processes

14 years 4 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 23 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers