Sciweavers

1166 search results - page 82 / 234
» Negotiating Using Rewards
Sort
View
EPIA
2001
Springer
14 years 2 months ago
Dynamic Evaluation of Coordination Mechanisms for Autonomous Agents
Abstract. This paper presents a formal framework within which autonomous agents can dynamically select and apply different mechanisms to coordinate their interactions with one ano...
Rachel A. Bourne, Karen Shoop, Nicholas R. Jenning...
UAI
2008
13 years 11 months ago
Improving Gradient Estimation by Incorporating Sensor Data
An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...
Gregory Lawrence, Stuart J. Russell
CORR
2010
Springer
94views Education» more  CORR 2010»
13 years 10 months ago
An Enhanced Search Technique for Managing Partial Coverage and Free Riding in P2P Networks
This paper presents a Q-learning based scheme for managing the partial coverage problem and the ill effects of free riding in unstructured P2P networks. Based on various parameter ...
Sabu M. Thampi, K. Chandra Sekaran
CDC
2010
IEEE
114views Control Systems» more  CDC 2010»
13 years 5 months ago
Modeling and evaluation of decision-making dynamics in sequential two-alternative forced choice tasks
The focus of the work in this paper is the evaluation of a model of human decision making relative to experimental data. In sequential two-alternative forced choice decision tasks,...
Caleb Woodruff, Kristi A. Morgansen, Linh Vu, Damo...
EOR
2007
123views more  EOR 2007»
13 years 10 months ago
Dynamic programming analysis of the TV game "Who wants to be a millionaire?"
This paper uses dynamic programming to investigate when contestants should use lifelines or when they should just stop answering in the TV quiz show ‘Who wants to be a millionai...
Federico Perea, Justo Puerto