Search Sciweavers | Sciweavers

58 search results - page 9 / 12

» A Dynamic Allocation Method of Basis Functions in Reinforcem...

124

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 4 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

149

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 3 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

137

click to vote

CIG
2005
IEEE

162views Applied Computing» more CIG 2005»

Nannon: A Nano Backgammon for Machine Learning Research

15 years 8 months ago

Download cswww.essex.ac.uk

A newly designed game is introduced, which feels like Backgammon, but has a simplified rule set. Unlike earlier attempts at simplifying the game, Nannon maintains enough features a...

Jordan B. Pollack

claim paper

Read More »

125

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 2 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

130

click to vote

KDD
1998
ACM

112views Data Mining» more KDD 1998»

Evaluating Usefulness for Dynamic Classification

15 years 7 months ago

Download www.aaai.org

This paper develops the concept of usefulness in the context of supervised learning. We argue that usefulness can be used to improve the performance of classification rules (as me...

Gholamreza Nakhaeizadeh, Charles Taylor, Carsten L...

claim paper

Read More »

« Prev « First page 9 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers