Search Sciweavers | Sciweavers

3381 search results - page 91 / 677

» LEO - DB2's LEarning Optimizer

158

click to vote

CDC
2010
IEEE

105views Control Systems» more CDC 2010»

Learning in mean-field oscillator games

14 years 11 months ago

Download mechse.illinois.edu

This research concerns a noncooperative dynamic game with large number of oscillators. The states are interpreted as the phase angles for a collection of non-homogeneous oscillator...

Huibing Yin, Prashant G. Mehta, Sean P. Meyn, Uday...

claim paper

Read More »

128

click to vote

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

16 years 5 months ago

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

128

Voted

GECCO
2000
Springer

121views Optimization» more GECCO 2000»

Metaphor for learning: an evolutionary algorithm

15 years 7 months ago

Download www.cslu.ogi.edu

The organizational algorithm is examined as a computational approach to representing interpersonal learning. The structure of the algorithm is introduced and described in context ...

Jody Lee Louse, Alexander Kain, James Hines

claim paper

Read More »

130

click to vote

ICML
2006
IEEE

130views Machine Learning» more ICML 2006»

Agnostic active learning

16 years 5 months ago

Download hunch.net

We state and analyze the first active learning algorithm which works in the presence of arbitrary forms of noise. The algorithm, A2 (for Agnostic Active), relies only upon the ass...

Maria-Florina Balcan, Alina Beygelzimer, John Lang...

claim paper

Read More »

134

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 5 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

« Prev « First page 91 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers