Search Sciweavers | Sciweavers

84 search results - page 8 / 17

» Integrating Reinforcement Learning, Bidding and Genetic Algo...

156

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

15 years 11 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

172

click to vote

AGI
2008

136views Artificial Intelligence» more AGI 2008»

An Integrative Methodology for Teaching Embodied Non-Linguistic Agents, Applied to Virtual Animals in Second Life

15 years 7 months ago

Download www.novamente.net

A teaching methodology called Imitative-Reinforcement-Corrective (IRC) learning is described, and proposed as a general approach for teaching embodied non-linguistic AGI systems. I...

Ben Goertzel, Cassio Pennachin, Nil Geisweiller, M...

claim paper

Read More »

254

click to vote

ATAL
2008
Springer

136views Intelligent Agents» more ATAL 2008»

Efficient multi-agent reinforcement learning through automated supervision

15 years 8 months ago

Download www.cs.umass.edu

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in large-scale systems. In this work, we develop a supervision fr...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

174

click to vote

HIS
2008

122views Information Technology» more HIS 2008»

New Crossover Operator for Evolutionary Rule Discovery in XCS

15 years 7 months ago

Download www.salle.url.edu

XCS is a learning classifier system that combines a reinforcement learning scheme with evolutionary algorithms to evolve rule sets on-line by means of the interaction with an envi...

Sergio Morales-Ortigosa, Albert Orriols-Puig, Este...

claim paper

Read More »

152

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 8 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers