Sciweavers

1166 search results - page 24 / 234
» Negotiating Using Rewards
Sort
View
ML
2002
ACM
168views Machine Learning» more  ML 2002»
13 years 8 months ago
On Average Versus Discounted Reward Temporal-Difference Learning
We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...
John N. Tsitsiklis, Benjamin Van Roy
ATAL
2004
Springer
14 years 2 months ago
A Bayes Net Approach to Argumentation
Argumentation-based negotiation approaches have been proposed to present realistic negotiation contexts. This paper presents a novel Bayesian network based argumentation and decis...
Sabyasachi Saha, Sandip Sen
NN
2007
Springer
105views Neural Networks» more  NN 2007»
13 years 8 months ago
Guiding exploration by pre-existing knowledge without modifying reward
Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...
Kary Främling
NIPS
2003
13 years 10 months ago
All learning is Local: Multi-agent Learning in Global Reward Games
In large multiagent games, partial observability, coordination, and credit assignment persistently plague attempts to design good learning algorithms. We provide a simple and efï¬...
Yu-Han Chang, Tracey Ho, Leslie Pack Kaelbling
NETWORKING
2000
13 years 10 months ago
QoS Rewards and Risks: A Multi-market Approach to Resource Allocation
A large number of network applications require a particular Quality of Service (QoS), that can be provided through proper network resource allocation. Furthermore, certain applicat...
Errin W. Fulp, Douglas S. Reeves