Search Sciweavers | Sciweavers

4544 search results - page 215 / 909

» Reinforcement Learning with Time

109

click to vote

CEEMAS
2005
Springer

87views Intelligent Agents» more CEEMAS 2005»

A Direct Reputation Model for VO Formation

15 years 8 months ago

Download www.dcs.kcl.ac.uk

We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...

Arturo Avila-Rosas, Michael Luck

claim paper

Read More »

105

click to vote

ICRA
1994
IEEE

105views Robotics» more ICRA 1994»

Harmonic Functions and Collision Probabilities

15 years 7 months ago

Download www.cs.cmu.edu

There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...

Christopher I. Connolly

claim paper

Read More »

150

click to vote

ROBOCUP
2000
Springer

130views Robotics» more ROBOCUP 2000»

Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition

15 years 6 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...

Yasutake Takahashi, Masanori Takeda, Minoru Asada

claim paper

Read More »

114

Voted

ESANN
2008

125views Neural Networks» more ESANN 2008»

Improvement in Game Agent Control Using State-Action Value Scaling

15 years 4 months ago

Download www.dice.ucl.ac.be

The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...

Leo Galway, Darryl Charles, Michaela M. Black

claim paper

Read More »

117

click to vote

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

15 years 4 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

« Prev « First page 215 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers