Search Sciweavers | Sciweavers

226 search results - page 40 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

click to vote

NIPS
2007

162views Information Technology» more NIPS 2007»

Bundle Methods for Machine Learning

13 years 8 months ago

Download books.nips.cc

We present a globally convergent method for regularized risk minimization problems. Our method applies to Support Vector estimation, regression, Gaussian Processes, and any other ...

Alex J. Smola, S. V. N. Vishwanathan, Quoc V. Le

claim paper

Read More »

click to vote

ATAL
2008
Springer

124views Intelligent Agents» more ATAL 2008»

Social reward shaping in the prisoner's dilemma

13 years 9 months ago

Download www.aamas-conference.org

Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...

Monica Babes, Enrique Munoz de Cote, Michael L. Li...

claim paper

Read More »

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

13 years 9 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

14 years 8 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

click to vote

AAAI
2008

141views Intelligent Agents» more AAAI 2008»

Economic Hierarchical Q-Learning

13 years 9 months ago

Download www.aaai.org

Hierarchical state decompositions address the curse-ofdimensionality in Q-learning methods for reinforcement learning (RL) but can suffer from suboptimality. In addressing this, w...

Erik G. Schultink, Ruggiero Cavallo, David C. Park...

claim paper

Read More »

« Prev « First page 40 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers