Search Sciweavers | Sciweavers

99

Voted

AAAI
2008

157views Intelligent Agents» more AAAI 2008»

Bayes-Relational Learning of Opponent Models from Incomplete Information in No-Limit Poker

15 years 4 months ago

We propose an opponent modeling approach for no-limit Texas hold-em poker that starts from a (learned) prior, i.e., general expectations about opponent behavior and learns a relat...

Marc J. V. Ponsen, Jan Ramon, Tom Croonenborghs, K...

claim paper

Read More »

142

click to vote

AAMAS
2007
Springer

164views Intelligent Agents» more AAMAS 2007»

Networks of Learning Automata and Limiting Games

15 years 8 months ago

Download como.vub.ac.be

Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is that...

Peter Vrancx, Katja Verbeeck, Ann Nowé

claim paper

Read More »

118

click to vote

ICANN
2009
Springer

123views Neural Networks» more ICANN 2009»

Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

15 years 6 months ago

Download www.tu-ilmenau.de

In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...

Alexander Hans, Steffen Udluft

claim paper

Read More »

137

click to vote

ICML
2010
IEEE

223views Machine Learning» more ICML 2010»

Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes

15 years 3 months ago

Download anytime.cs.umass.edu

Approximate dynamic programming has been used successfully in a large variety of domains, but it relies on a small set of provided approximation features to calculate solutions re...

Marek Petrik, Gavin Taylor, Ronald Parr, Shlomo Zi...

claim paper

Read More »

93

click to vote

ECML
2003
Springer

78views Machine Learning» more ECML 2003»

Learning Context Free Grammars in the Limit Aided by the Sample Distribution

15 years 7 months ago

Download www.ozsl.uu.nl

We present an algorithm for learning context free grammars from positive structural examples (unlabeled parse trees). The algorithm receives a parameter in the form of a ﬁnite se...

Yoav Seginer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers