Search Sciweavers | Sciweavers

87 search results - page 7 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

ICTAI
2006
IEEE

110views Artificial Intelligence» more ICTAI 2006»

A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem

14 years 1 months ago

Download www.loria.fr

We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...

Lhassane Idoumghar, René Schott

claim paper

Read More »

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

13 years 11 months ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

click to vote

IJCAI
2003

173views Artificial Intelligence» more IJCAI 2003»

A Planning Algorithm for Predictive State Representations

13 years 8 months ago

Download dli.iiit.ac.in

We address the problem of optimally controlling stochastic environments that are partially observable. The standard method for tackling such problems is to define and solve a Part...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

Voted

GECCO
2004
Springer

142views Optimization» more GECCO 2004»

Improving MACS Thanks to a Comparison with 2TBNs

14 years 26 days ago

Download www.cs.york.ac.uk

Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classiﬁer Systems research. This framework is mostly used in the context ...

Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil...

claim paper

Read More »

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

13 years 2 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 7 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers