Sciweavers

1166 search results - page 21 / 234
» Negotiating Using Rewards
Sort
View
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
14 years 3 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
ICML
2006
IEEE
14 years 9 months ago
An intrinsic reward mechanism for efficient exploration
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...
Özgür Simsek, Andrew G. Barto
IJCAI
2001
13 years 10 months ago
Complexity of Probabilistic Planning under Average Rewards
A general and expressive model of sequential decision making under uncertainty is provided by the Markov decision processes (MDPs) framework. Complex applications with very large ...
Jussi Rintanen
SAC
2006
ACM
14 years 2 months ago
Implementing rule-based mechanisms for agent-based price negotiations
This note describes a sample implementation of automated negotiations in an e-commerce modeling multi-agent system. A specific set of rules is used for enforcing negotiation mech...
Costin Badica, Adriana Badita, Maria Ganzha
ALT
2007
Springer
14 years 5 months ago
Pseudometrics for State Aggregation in Average Reward Markov Decision Processes
We consider how state similarity in average reward Markov decision processes (MDPs) may be described by pseudometrics. Introducing the notion of adequate pseudometrics which are we...
Ronald Ortner