Sciweavers

252 search results - page 33 / 51
» Learning Partially Observable Action Models: Efficient Algor...
Sort
View
138
Voted
ATAL
2011
Springer
14 years 2 months ago
Game theory-based opponent modeling in large imperfect-information games
We develop an algorithm for opponent modeling in large extensive-form games of imperfect information. It works by observing the opponent’s action frequencies and building an opp...
Sam Ganzfried, Tuomas Sandholm
198
Voted
CSL
2012
Springer
13 years 10 months ago
Reinforcement learning for parameter estimation in statistical spoken dialogue systems
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...
Filip Jurcícek, Blaise Thomson, Steve Young
103
Voted
CORR
2010
Springer
106views Education» more  CORR 2010»
15 years 2 months ago
MDPs with Unawareness
Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...
Joseph Y. Halpern, Nan Rong, Ashutosh Saxena
127
Voted
JSAC
2010
138views more  JSAC 2010»
15 years 1 months ago
Dynamic conjectures in random access networks using bio-inspired learning
—Inspired by the biological entities’ ability to achieve reciprocity in the course of evolution, this paper considers a conjecture-based distributed learning approach that enab...
Yi Su, Mihaela van der Schaar
LAMAS
2005
Springer
15 years 8 months ago
The Success and Failure of Tag-Mediated Evolution of Cooperation
Use of tags to limit partner selection for playing has been shown to produce stable cooperation in agent populations playing the Prisoner’s Dilemma game. There is, however, a lac...
Austin McDonald, Sandip Sen