Search Sciweavers | Sciweavers

364 search results - page 44 / 73

» Regularization Learning of Neural Networks for Generalizatio...

196

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 5 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

183

click to vote

GECCO
2007
Springer

195views Optimization» more GECCO 2007»

MILCS: a mutual information learning classifier system

16 years 1 months ago

Download www.cs.bham.ac.uk

This paper introduces a new variety of learning classifier system (LCS), called MILCS, which utilizes mutual information as fitness feedback. Unlike most LCSs, MILCS is specifical...

Robert Elliott Smith, Max Kun Jiang

claim paper

Read More »

213

click to vote

CIKM
2011
Springer

201views Information Technology» more CIKM 2011»

Content based social behavior prediction: a multi-task learning approach

14 years 7 months ago

Download people.eecs.ku.edu

The study of information ﬂow analyzes the principles and mechanisms of social information distribution. It is becoming an extremely important research topic in social network re...

Hongliang Fei, Ruoyi Jiang, Yuhao Yang, Bo Luo, Ju...

claim paper

Read More »

152

click to vote

CIKM
2000
Springer

104views Information Technology» more CIKM 2000»

Relevance and Reinforcement in Interactive Browsing

15 years 11 months ago

Download ciir.cs.umass.edu

We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...

Anton Leuski

claim paper

Read More »

177

click to vote

IJCNN
2006
IEEE

109views Neural Networks» more IJCNN 2006»

On derivation of stagewise second-order backpropagation by invariant imbedding for multi-stage neural-network learning

16 years 1 months ago

Download www.ieor.berkeley.edu

— We present a simple, intuitive argument based on “invariant imbedding” in the spirit of dynamic programming to derive a stagewise second-order backpropagation (BP) algorith...

Eiji Mizutani, Stuart Dreyfus

claim paper

Read More »

« Prev « First page 44 / 73 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers