Search Sciweavers | Sciweavers

1799 search results - page 228 / 360

» Filtered Reinforcement Learning

109

click to vote

ICALT
2006
IEEE

143views Machine Learning» more ICALT 2006»

Citizenship Education Using Human- and Agent-Based Participatory Gaming Simulation

15 years 8 months ago

Download www.ai.soc.i.kyoto-u.ac.jp

In this paper, we describe new methodologies for reinforcing the social consensus building of the multicultural coexistence assistance using participatory simulation in civil soci...

Reiko Hishiyama, Toru Ishida

claim paper

Read More »

Voted

IROS
2006
IEEE

147views Robotics» more IROS 2006»

A Hybrid Control Architecture for Autonomous Robotic Fish

15 years 8 months ago

Download cswww.essex.ac.uk

— This paper presents a hybrid control architecture for autonomous robotic ﬁshes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...

Jindong Liu, Huosheng Hu, Dongbing Gu

claim paper

Read More »

109

Voted

CEEMAS
2005
Springer

87views Intelligent Agents» more CEEMAS 2005»

A Direct Reputation Model for VO Formation

15 years 8 months ago

Download www.dcs.kcl.ac.uk

We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...

Arturo Avila-Rosas, Michael Luck

claim paper

Read More »

102

Voted

ICRA
1994
IEEE

105views Robotics» more ICRA 1994»

Harmonic Functions and Collision Probabilities

15 years 6 months ago

Download www.cs.cmu.edu

There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...

Christopher I. Connolly

claim paper

Read More »

148

Voted

ROBOCUP
2000
Springer

130views Robotics» more ROBOCUP 2000»

Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition

15 years 6 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...

Yasutake Takahashi, Masanori Takeda, Minoru Asada

claim paper

Read More »

« Prev « First page 228 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers