Sciweavers

1166 search results - page 96 / 234
» Negotiating Using Rewards
Sort
View
GECCO
2006
Springer
133views Optimization» more  GECCO 2006»
14 years 1 months ago
On-line evolutionary computation for reinforcement learning in stochastic domains
In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...
Shimon Whiteson, Peter Stone
ACL
2008
13 years 11 months ago
Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation
We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...
Verena Rieser, Oliver Lemon
WSC
1998
13 years 11 months ago
A Study of Self-adjusting Quality of Service Control Schemes
This paper reports simulation methods and results for analyzing a self-adjusting Quality of Service (QoS) control scheme for multimedia/telecommunication systems based on resource...
Sheng-Tzong Cheng, Chi-Ming Chen, Ing-Ray Chen
ML
1998
ACM
117views Machine Learning» more  ML 1998»
13 years 9 months ago
Learning Team Strategies: Soccer Case Studies
We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy, but may behave di erently due to position-dependent inputs. All...
Rafal Salustowicz, Marco Wiering, Jürgen Schm...
ICRA
2010
IEEE
162views Robotics» more  ICRA 2010»
13 years 8 months ago
Adaptive multi-robot coordination: A game-theoretic perspective
Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...
Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus