Sciweavers

1166 search results - page 162 / 234
» Negotiating Using Rewards
Sort
View
ICFP
2006
ACM
14 years 10 months ago
Programming monads operationally with Unimo
Monads are widely used in Haskell for modeling computational effects, but defining monads remains a daunting challenge. Since every part of a monad's definition depends on it...
Chuan-Kai Lin
ICRA
2009
IEEE
132views Robotics» more  ICRA 2009»
14 years 4 months ago
Smoothed Sarsa: Reinforcement learning for robot delivery tasks
— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to find a sequence of actio...
Deepak Ramachandran, Rakesh Gupta
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
14 years 4 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
VTC
2008
IEEE
185views Communications» more  VTC 2008»
14 years 4 months ago
Opportunistic Spectrum Access for Energy-Constrained Cognitive Radios
This paper considers a scenario in which a secondary user makes opportunistic use of a channel allocated to some primary network. The primary network operates in a time-slotted ma...
Anh Tuan Hoang, Ying-Chang Liang, David Tung Chong...
AIMSA
2008
Springer
14 years 4 months ago
A Social and Emotional Model for Obtaining Believable Emergent Behaviors
Abstract This paper attempts to define an emotional model for virtual agents that behave autonomously in social worlds. We adopt shallow modeling based on the decomposition of the...
Javier Asensio, Marta Jiménez, Susana Fern&...