Search Sciweavers | Sciweavers

3381 search results - page 314 / 677

» LEO - DB2's LEarning Optimizer

154

Voted

ICML
2002
IEEE

127views Machine Learning» more ICML 2002»

Action Refinement in Reinforcement Learning by Probability Smoothing

16 years 7 months ago

Download www.cs.berkeley.edu

In many reinforcement learning applications, the set of possible actions can be partitioned by the programmer into subsets of similar actions. This paper presents a technique for ...

Carles Sierra, Dídac Busquets, Ramon L&oacu...

claim paper

Read More »

179

click to vote

ALT
2005
Springer

137views Machine Learning» more ALT 2005»

Defensive Universal Learning with Experts

16 years 3 months ago

Download www.idsia.ch

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedba...

Jan Poland, Marcus Hutter

claim paper

Read More »

140

click to vote

PKDD
2009
Springer

117views Data Mining» more PKDD 2009»

New Regularized Algorithms for Transductive Learning

16 years 20 days ago

Download www.cis.upenn.edu

Abstract. We propose a new graph-based label propagation algorithm for transductive learning. Each example is associated with a vertex in an undirected graph and a weighted edge be...

Partha Pratim Talukdar, Koby Crammer

claim paper

Read More »

181

Voted

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

16 years 9 days ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

143

click to vote

GECCO
2007
Springer

176views Optimization» more GECCO 2007»

The effect of learning on life history evolution

16 years 9 days ago

Download www.cs.bham.ac.uk

A series of evolutionary neural network simulations are presented which explore the hypothesis that learning factors can result in the evolution of long periods of parental protec...

John A. Bullinaria

claim paper

Read More »

« Prev « First page 314 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers