Sciweavers

1799 search results - page 228 / 360
» Filtered Reinforcement Learning
Sort
View
ICALT
2006
IEEE
15 years 8 months ago
Citizenship Education Using Human- and Agent-Based Participatory Gaming Simulation
In this paper, we describe new methodologies for reinforcing the social consensus building of the multicultural coexistence assistance using participatory simulation in civil soci...
Reiko Hishiyama, Toru Ishida
97
Voted
IROS
2006
IEEE
147views Robotics» more  IROS 2006»
15 years 8 months ago
A Hybrid Control Architecture for Autonomous Robotic Fish
— This paper presents a hybrid control architecture for autonomous robotic fishes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...
Jindong Liu, Huosheng Hu, Dongbing Gu
109
Voted
CEEMAS
2005
Springer
15 years 8 months ago
A Direct Reputation Model for VO Formation
We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...
Arturo Avila-Rosas, Michael Luck
102
Voted
ICRA
1994
IEEE
105views Robotics» more  ICRA 1994»
15 years 6 months ago
Harmonic Functions and Collision Probabilities
There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...
Christopher I. Connolly
148
Voted
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
15 years 6 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada