Search Sciweavers | Sciweavers

268 search results - page 50 / 54

» Solving multiagent assignment Markov decision processes

177

click to vote

NIPS
2008

109views Information Technology» more NIPS 2008»

Biasing Approximate Dynamic Programming with a Lower Discount Factor

15 years 8 months ago

Download hal.inria.fr

Most algorithms for solving Markov decision processes rely on a discount factor, which ensures their convergence. It is generally assumed that using an artificially low discount f...

Marek Petrik, Bruno Scherrer

claim paper

Read More »

164

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 8 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

168

click to vote

IJCAI
2003

137views Artificial Intelligence» more IJCAI 2003»

Approximating Optimal Policies for Agents with Limited Execution Resources

15 years 8 months ago

Download ai.stanford.edu

An agent with limited consumable execution resources needs policies that attempt to achieve good performance while respecting these limitations. Otherwise, an agent (such as a pla...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

198

click to vote

DSN
2009
IEEE

131views Computer Networks» more DSN 2009»

RRE: A game-theoretic intrusion Response and Recovery Engine

15 years 4 months ago

Download netfiles.uiuc.edu

Preserving the availability and integrity of networked computing systems in the face of fast-spreading intrusions requires advances not only in detection algorithms, but also in a...

Saman A. Zonouz, Himanshu Khurana, William H. Sand...

claim paper

Read More »

221

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 4 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

« Prev « First page 50 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers