Search Sciweavers | Sciweavers

827 search results - page 75 / 166

» Variational methods for Reinforcement Learning

217

click to vote

ROBOCUP
2000
Springer

130views Robotics» more ROBOCUP 2000»

Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition

15 years 10 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...

Yasutake Takahashi, Masanori Takeda, Minoru Asada

claim paper

Read More »

202

click to vote

ANOR
2005

80views more ANOR 2005»

Entropic Penalties in Finite Games

15 years 7 months ago

Download www.science.unitn.it

The main objects here are finite-strategy games in which entropic terms are subtracted from the payoffs. After such subtraction each Nash equilibrium solves an explicit, unconstra...

Sjur Didrik Flåm, E. Cavazzuti

claim paper

Read More »

190

click to vote

GECCO
2008
Springer

128views Optimization» more GECCO 2008»

Adapted Pittsburgh classifier system: building accurate strategies in non markovian environments

15 years 8 months ago

Download www.cs.bham.ac.uk

This paper focuses on the study of the behavior of a genetic algorithm based classiﬁer system, the Adapted Pittsburgh Classiﬁer System (A.P.C.S), on maze type environments con...

Gilles Énée, Mathias Péroumal...

claim paper

Read More »

200

click to vote

ICML
2004
IEEE

163views Machine Learning» more ICML 2004»

Multi-task feature and kernel selection for SVMs

16 years 8 months ago

Download www1.cs.columbia.edu

We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is ...

Tony Jebara

claim paper

Read More »

170

Voted

ESANN
2007

122views Neural Networks» more ESANN 2007»

The Recurrent Control Neural Network

15 years 8 months ago

Download www.dice.ucl.ac.be

This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-eﬃcient modelling and control of reinforcement learning problems in di...

Anton Maximilian Schäfer, Steffen Udluft, Han...

claim paper

Read More »

« Prev « First page 75 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers