Search Sciweavers | Sciweavers

200 search results - page 20 / 40

» Point-Based Policy Iteration

198

click to vote

CIA
2007
Springer

143views Intelligent Agents» more CIA 2007»

Multi-agent Learning Dynamics: A Survey

16 years 27 days ago

Download michaelkaisers.com

Abstract. In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration a...

H. Jaap van den Herik, Daniel Hennes, Michael Kais...

claim paper

Read More »

225

click to vote

ICTAI
2006
IEEE

110views Artificial Intelligence» more ICTAI 2006»

A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem

16 years 22 days ago

Download www.loria.fr

We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...

Lhassane Idoumghar, René Schott

claim paper

Read More »

176

click to vote

CORR
2008
Springer

132views Education» more CORR 2008»

Dynamic Rate Allocation in Fading Multiple-access Channels

15 years 6 months ago

Download web.mit.edu

We consider the problem of rate allocation in a fading Gaussian multiple-access channel (MAC) with fixed transmission powers. Our goal is to maximize a general concave utility func...

Ali ParandehGheibi, Atilla Eryilmaz, Asuman E. Ozd...

claim paper

Read More »

190

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 8 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

163

click to vote

AIPS
2004

82views Artificial Intelligence» more AIPS 2004»

Learning Domain-Specific Control Knowledge from Random Walks

15 years 8 months ago

Download www2.parc.com

We describe and evaluate a system for learning domainspecific control knowledge. In particular, given a planning domain, the goal is to output a control policy that performs well ...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

« Prev « First page 20 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers