Search Sciweavers | Sciweavers

100 search results - page 13 / 20

» Basis Function Construction in Reinforcement Learning Using ...

117

Voted

NIPS
2004

92views Information Technology» more NIPS 2004»

Responding to Modalities with Different Latencies

15 years 4 months ago

Download books.nips.cc

Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...

Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...

claim paper

Read More »

134

click to vote

ML
1998
ACM

131views Machine Learning» more ML 1998»

Learning from Examples and Membership Queries with Structured Determinations

15 years 2 months ago

Download web.engr.oregonstate.edu

It is well known that prior knowledge or bias can speed up learning, at least in theory. It has proved di cult to make constructive use of prior knowledge, so that approximately c...

Prasad Tadepalli, Stuart J. Russell

claim paper

Read More »

122

Voted

NIPS
1993

123views Information Technology» more NIPS 1993»

Temporal Difference Learning of Position Evaluation in the Game of Go

15 years 4 months ago

Download www.gatsby.ucl.ac.uk

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...

Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...

claim paper

Read More »

147

Voted

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 2 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

135

Voted

NPL
1998

175views more NPL 1998»

Prediction of Chaotic Time-Series with a Resource-Allocating RBF Network

15 years 2 months ago

Download www.meduniwien.ac.at

Abstract. One of the main problems associated with arti cial neural networks online learning methods is the estimation of model order. In this paper, we report about a new approach...

Roman Rosipal, Milos Koska, Igor Farkas

claim paper

Read More »

« Prev « First page 13 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers