Search Sciweavers | Sciweavers

87 search results - page 9 / 18

» Hybrid Least-Squares Algorithms for Approximate Policy Evalu...

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

14 years 1 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

click to vote

ISSAC
2007
Springer

153views Mathematics» more ISSAC 2007»

On exact and approximate interpolation of sparse rational functions

14 years 2 months ago

Download www4.ncsu.edu

The black box algorithm for separating the numerator from the denominator of a multivariate rational function can be combined with sparse multivariate polynomial interpolation alg...

Erich Kaltofen, Zhengfeng Yang

claim paper

Read More »

click to vote

JAIR
2008

126views more JAIR 2008»

Optimal and Approximate Q-value Functions for Decentralized POMDPs

13 years 7 months ago

Download www.jair.org

Decision-theoretic planning is a popular approach to sequential decision making problems, because it treats uncertainty in sensing and acting in a principled way. In single-agent ...

Frans A. Oliehoek, Matthijs T. J. Spaan, Nikos A. ...

claim paper

Read More »

click to vote

IEEECIT
2010
IEEE

105views Information Technology» more IEEECIT 2010»

Predictive and Dynamic Resource Allocation for Enterprise Applications

13 years 6 months ago

Download www2.warwick.ac.uk

—Dynamic resource allocation has the potential to provide signiﬁcant increases in total revenue in enterprise systems through the reallocation of available resources as the dem...

M. Al-Ghamdi, Adam P. Chester, Stephen A. Jarvis

claim paper

Read More »

click to vote

ATAL
2007
Springer

141views Intelligent Agents» more ATAL 2007»

Commitment-driven distributed joint policy search

14 years 2 months ago

Download www-personal.umich.edu

Decentralized MDPs provide powerful models of interactions in multi-agent environments, but are often very diﬃcult or even computationally infeasible to solve optimally. Here we...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 9 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers