Search Sciweavers | Sciweavers

332 search results - page 13 / 67

» Ranking policies in discrete Markov decision processes

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 2 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

ATAL
2009
Springer

155views Intelligent Agents» more ATAL 2009»

Planning with continuous resources for agent teams

14 years 2 months ago

Download www.aamas-conference.org

Many problems of multiagent planning under uncertainty require distributed reasoning with continuous resources and resource limits. Decentralized Markov Decision Problems (Dec-MDP...

Janusz Marecki, Milind Tambe

claim paper

Read More »

click to vote

DAC
2000
ACM

179views Computer Architecture» more DAC 2000»

Dynamic power management of complex systems using generalized stochastic Petri nets

14 years 9 months ago

Download atrak.usc.edu

In this paper, we introduce a new technique for modeling and solving the dynamic power management (DPM) problem for systems with complex behavioral characteristics such as concurr...

Qinru Qiu, Qing Wu, Massoud Pedram

claim paper

Read More »

click to vote

IJCAI
2003

173views Artificial Intelligence» more IJCAI 2003»

A Planning Algorithm for Predictive State Representations

13 years 9 months ago

Download dli.iiit.ac.in

We address the problem of optimally controlling stochastic environments that are partially observable. The standard method for tackling such problems is to define and solve a Part...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

click to vote

ATAL
2008
Springer

116views Intelligent Agents» more ATAL 2008»

Controlling deliberation in a Markov decision process-based agent

13 years 10 months ago

Download coitweb.uncc.edu

Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision...

George Alexander, Anita Raja, David J. Musliner

claim paper

Read More »

« Prev « First page 13 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers