Search Sciweavers | Sciweavers

252 search results - page 33 / 51

» Learning Partially Observable Action Models: Efficient Algor...

186

click to vote

ATAL
2011
Springer

234views Intelligent Agents» more ATAL 2011»

Game theory-based opponent modeling in large imperfect-information games

14 years 6 months ago

Download www.cs.cmu.edu

We develop an algorithm for opponent modeling in large extensive-form games of imperfect information. It works by observing the opponent’s action frequencies and building an opp...

Sam Ganzfried, Tuomas Sandholm

claim paper

Read More »

256

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 2 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

160

click to vote

CORR
2010
Springer

106views Education» more CORR 2010»

MDPs with Unawareness

15 years 6 months ago

Download www.cs.cornell.edu

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...

Joseph Y. Halpern, Nan Rong, Ashutosh Saxena

claim paper

Read More »

171

click to vote

JSAC
2010

138views more JSAC 2010»

Dynamic conjectures in random access networks using bio-inspired learning

15 years 4 months ago

Download medianetlab.ee.ucla.edu

—Inspired by the biological entities’ ability to achieve reciprocity in the course of evolution, this paper considers a conjecture-based distributed learning approach that enab...

Yi Su, Mihaela van der Schaar

claim paper

Read More »

179

click to vote

LAMAS
2005
Springer

110views Intelligent Agents» more LAMAS 2005»

The Success and Failure of Tag-Mediated Evolution of Cooperation

15 years 12 months ago

Download www.cs.cmu.edu

Use of tags to limit partner selection for playing has been shown to produce stable cooperation in agent populations playing the Prisoner’s Dilemma game. There is, however, a lac...

Austin McDonald, Sandip Sen

claim paper

Read More »

« Prev « First page 33 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers