Sciweavers

1234 search results - page 203 / 247
» Multi-criteria Reinforcement Learning
Sort
View
110
Voted
IROS
2006
IEEE
147views Robotics» more  IROS 2006»
15 years 9 months ago
A Hybrid Control Architecture for Autonomous Robotic Fish
— This paper presents a hybrid control architecture for autonomous robotic fishes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...
Jindong Liu, Huosheng Hu, Dongbing Gu
117
Voted
CEEMAS
2005
Springer
15 years 9 months ago
A Direct Reputation Model for VO Formation
We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...
Arturo Avila-Rosas, Michael Luck
114
Voted
ICRA
1994
IEEE
105views Robotics» more  ICRA 1994»
15 years 8 months ago
Harmonic Functions and Collision Probabilities
There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...
Christopher I. Connolly
162
Voted
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
15 years 7 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
122
Voted
ESANN
2008
15 years 5 months ago
Improvement in Game Agent Control Using State-Action Value Scaling
The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...
Leo Galway, Darryl Charles, Michaela M. Black