Sciweavers

1166 search results - page 96 / 234
» Negotiating Using Rewards
Sort
View
123
Voted
GECCO
2006
Springer
133views Optimization» more  GECCO 2006»
15 years 5 months ago
On-line evolutionary computation for reinforcement learning in stochastic domains
In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...
Shimon Whiteson, Peter Stone
ACL
2008
15 years 3 months ago
Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation
We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...
Verena Rieser, Oliver Lemon
123
Voted
WSC
1998
15 years 3 months ago
A Study of Self-adjusting Quality of Service Control Schemes
This paper reports simulation methods and results for analyzing a self-adjusting Quality of Service (QoS) control scheme for multimedia/telecommunication systems based on resource...
Sheng-Tzong Cheng, Chi-Ming Chen, Ing-Ray Chen
ML
1998
ACM
117views Machine Learning» more  ML 1998»
15 years 1 months ago
Learning Team Strategies: Soccer Case Studies
We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy, but may behave di erently due to position-dependent inputs. All...
Rafal Salustowicz, Marco Wiering, Jürgen Schm...
ICRA
2010
IEEE
162views Robotics» more  ICRA 2010»
15 years 20 days ago
Adaptive multi-robot coordination: A game-theoretic perspective
Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...
Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus