Sciweavers

1684 search results - page 134 / 337
» The lexicographic decision function
Sort
View
ICML
1994
IEEE
15 years 7 months ago
Markov Games as a Framework for Multi-Agent Reinforcement Learning
In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....
Michael L. Littman
HCI
2007
15 years 5 months ago
Models of Command and Control
This paper reports on five different models of command and control. Four different models are reviewed: a process model, a contextual control model, a decision ladder model and a ...
Neville A. Stanton, Guy H. Walker, Daniel P. Jenki...
IJCAI
2007
15 years 5 months ago
Vote and Aggregation in Combinatorial Domains with Structured Preferences
In many real-world collective decision problems, the set of alternatives is a Cartesian product of finite value domains for each of a given set of variables. The prohibitive size...
Jérôme Lang
NIPS
2008
15 years 5 months ago
Adapting to a Market Shock: Optimal Sequential Market-Making
We study the profit-maximization problem of a monopolistic market-maker who sets two-sided prices in an asset market. The sequential decision problem is hard to solve because the ...
Sanmay Das, Malik Magdon-Ismail
NIPS
2008
15 years 5 months ago
Bayesian Model of Behaviour in Economic Games
Classical game theoretic approaches that make strong rationality assumptions have difficulty modeling human behaviour in economic games. We investigate the role of finite levels o...
Debajyoti Ray, Brooks King-Casas, P. Read Montague...