Search Sciweavers | Sciweavers

1236 search results - page 183 / 248

» Opposition-Based Reinforcement Learning

160

click to vote

ISNN
2007
Springer

116views Neural Networks» more ISNN 2007»

Online Dynamic Value System for Machine Learning

16 years 13 days ago

Download www.ent.ohiou.edu

A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...

Haibo He, Janusz A. Starzyk

claim paper

Read More »

126

click to vote

ATAL
2004
Springer

221views Intelligent Agents» more ATAL 2004»

When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents

15 years 11 months ago

Download leibniz.cs.huji.ac.il

This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, speciﬁcally, geneticlearning-parentin...

Michael Berger, Jeffrey S. Rosenschein

claim paper

Read More »

169

click to vote

ESANN
2007

148views Neural Networks» more ESANN 2007»

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

15 years 7 months ago

Download www.dice.ucl.ac.be

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...

Jan Peters, Stefan Schaal

claim paper

Read More »

158

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

16 years 7 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

163

click to vote

FBIT
2007
IEEE

142views Information Technology» more FBIT 2007»

Learning to Drive a Real Car in 20 Minutes

16 years 19 days ago

Download www.ni.uos.de

The paper describes our ﬁrst experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...

Martin Riedmiller, Michael Montemerlo, Hendrik Dah...

claim paper

Read More »

« Prev « First page 183 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers