Sciweavers

1166 search results - page 100 / 234
» Negotiating Using Rewards
Sort
View
IAT
2009
IEEE
14 years 2 months ago
Offline Planning for Communication by Exploiting Structured Interactions in Decentralized MDPs
Variants of the decentralized MDP model focus on problems exhibiting some special structure that makes them easier to solve in practice. Our work is concerned with two main issues...
Hala Mostafa, Victor R. Lesser
CEC
2008
IEEE
14 years 4 days ago
Learning benefits evolution if sex gives pleasure
Abstract-- In this paper we investigate the effects of individual learning on an evolving population of situated agents. We work with a novel type of system where agents can decide...
Robert Griffioen, Selmar K. Smit, A. E. Eiben
AAAI
2006
13 years 11 months ago
Functional Value Iteration for Decision-Theoretic Planning with General Utility Functions
We study how to find plans that maximize the expected total utility for a given MDP, a planning objective that is important for decision making in high-stakes domains. The optimal...
Yaxin Liu, Sven Koenig
AAMAS
2010
Springer
13 years 10 months ago
Teaching a pet-robot to understand user feedback through interactive virtual training tasks
Abstract In this paper, we present a human-robot teaching framework that uses "virtual" games as a means for adapting a robot to its user through natural interaction in a...
Anja Austermann, Seiji Yamada
CORR
2010
Springer
127views Education» more  CORR 2010»
13 years 10 months ago
Mean field for Markov Decision Processes: from Discrete to Continuous Optimization
We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal...
Nicolas Gast, Bruno Gaujal, Jean-Yves Le Boudec