Search Sciweavers | Sciweavers

155 search results - page 29 / 31

» Multi-agent Reinforcement Learning Using Strategies and Voti...

165

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 7 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

184

click to vote

ROBOCUP
2004
Springer

147views Robotics» more ROBOCUP 2004»

Learning to Drive and Simulate Autonomous Mobile Robots

16 years 1 days ago

Download page.mi.fu-berlin.de

We show how to apply learning methods to two robotics problems, namely the optimization of the on-board controller of an omnidirectional robot, and the derivation of a model of the...

Alexander Gloye, Cüneyt Göktekin, Anna E...

claim paper

Read More »

177

click to vote

GECCO
2006
Springer

142views Optimization» more GECCO 2006»

Classifier prediction based on tile coding

15 years 10 months ago

Download www.eskimo.com

This paper introduces XCSF extended with tile coding prediction: each classifier implements a tile coding approximator; the genetic algorithm is used to adapt both classifier cond...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

162

click to vote

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 6 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

184

click to vote

IROS
2007
IEEE

172views Robotics» more IROS 2007»

Motor control optimization of compliant one-legged locomotion in rough terrain

16 years 1 months ago

Download groups.csail.mit.edu

— While underactuated robotic systems are capable of energy efﬁcient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for ada...

Fumiya Iida, Russ Tedrake

claim paper

Read More »

« Prev « First page 29 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers