Search Sciweavers | Sciweavers

340 search results - page 17 / 68

» Kernelized value function approximation for reinforcement le...

132

Voted

AIIDE
2006

123views Artificial Intelligence» more AIIDE 2006»

The Self Organization of Context for Learning in MultiAgent Games

15 years 3 months ago

Download www.aaai.org

Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, ti...

Christopher D. White, Dave Brogan

claim paper

Read More »

120

Voted

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 3 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

133

click to vote

CIG
2005
IEEE

162views Applied Computing» more CIG 2005»

Nannon: A Nano Backgammon for Machine Learning Research

15 years 8 months ago

Download cswww.essex.ac.uk

A newly designed game is introduced, which feels like Backgammon, but has a simplified rule set. Unlike earlier attempts at simplifying the game, Nannon maintains enough features a...

Jordan B. Pollack

claim paper

Read More »

125

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 4 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

126

Voted

AAAI
2010

178views Intelligent Agents» more AAAI 2010»

Stability and Incentive Compatibility in a Kernel-Based Combinatorial Auction

15 years 3 months ago

Download www.research.yahoo.com

We present the design and analysis of an approximately incentive-compatible combinatorial auction. In just a single run, the auction is able to extract enough value information fr...

Sébastien Lahaie

claim paper

Read More »

« Prev « First page 17 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers