Search Sciweavers | Sciweavers

1233 search results - page 110 / 247

» Feudal Reinforcement Learning

155

Voted

ISNN
2007
Springer

116views Neural Networks» more ISNN 2007»

Online Dynamic Value System for Machine Learning

16 years 8 days ago

Download www.ent.ohiou.edu

A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...

Haibo He, Janusz A. Starzyk

claim paper

Read More »

122

click to vote

ATAL
2004
Springer

221views Intelligent Agents» more ATAL 2004»

When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents

15 years 11 months ago

Download leibniz.cs.huji.ac.il

This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, speciﬁcally, geneticlearning-parentin...

Michael Berger, Jeffrey S. Rosenschein

claim paper

Read More »

157

Voted

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 5 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

185

click to vote

MAGS
2010

81views more MAGS 2010»

Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation

15 years 28 days ago

Download damas.ift.ulaval.ca

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa, Patric...

claim paper

Read More »

155

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

16 years 7 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

« Prev « First page 110 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers