Sciweavers

1233 search results - page 211 / 247
» Reinforcement learning
Sort
View
138
Voted

Publication
222views
15 years 11 months ago
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration
Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...
Christos Dimitrakakis, Michail G. Lagoudakis
132
Voted
IROS
2007
IEEE
168views Robotics» more  IROS 2007»
15 years 9 months ago
Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression
Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...
Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...
120
Voted
ATAL
2005
Springer
15 years 8 months ago
Modeling task allocation using a decision theoretic model
Mediation is the process of decomposing a task into subtasks, finding agents suitable for these subtasks and negotiating with agents to obtain commitments to execute these subtas...
Sherief Abdallah, Victor R. Lesser
ISCC
2003
IEEE
110views Communications» more  ISCC 2003»
15 years 8 months ago
Intelligent Agents Serving Based On The Society Information
In this paper, we propose a serving system consisting intelligent agents processing society information in a multi-user domain. The agents use the similarity information on the us...
Sanem Sariel, B. Tevfik Akgün
126
Voted
ECAL
2007
Springer
15 years 6 months ago
Genotype Reuse More Important than Genotype Size in Evolvability of Embodied Neural Networks
odel of Embodiment on Abstract Systems: from Hierarchy to Heterarchy Kohei Nakajima, Soya Shinkai, Takashi Ikegami A Behavior-Based Model of the Hydra, Phylum Cnidaria Malin Aktius...
Chad W. Seys, Randall D. Beer