Search Sciweavers | Sciweavers

226 search results - page 26 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

205

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

15 years 3 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

158

click to vote

ICML
2005
IEEE

201views Machine Learning» more ICML 2005»

Interactive learning of mappings from visual percepts to actions

16 years 6 months ago

Download www.machinelearning.org

We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier i...

Justus H. Piater, Sébastien Jodogne

claim paper

Read More »

163

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 5 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

149

click to vote

IROS
2007
IEEE

144views Robotics» more IROS 2007»

Using reinforcement learning to adapt an imitation task

16 years 8 days ago

Download lasa.epfl.ch

Abstract— The goal of developing algorithms for programming robots by demonstration is to create an easy way of programming robots that can be accomplished by everyone. When a de...

Florent Guenter, Aude Billard

claim paper

Read More »

176

click to vote

ICAC
2008
IEEE

99views Applied Computing» more ICAC 2008»

Utility-Based Reinforcement Learning for Reactive Grids

16 years 13 days ago

Download hal.inria.fr

—Large scale production grids are an important case for autonomic computing. They follow a mutualization paradigm: decision-making (human or automatic) is distributed and largely...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

« Prev « First page 26 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers