Search Sciweavers | Sciweavers

87 search results - page 11 / 18

» Hybrid Least-Squares Algorithms for Approximate Policy Evalu...

click to vote

PROMAS
2004
Springer

189views Intelligent Agents» more PROMAS 2004»

Coordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach

14 years 1 months ago

Download teamcore.usc.edu

Distributed partially observable Markov decision problems (POMDPs) have emerged as a popular decision-theoretic approach for planning for multiagent teams, where it is imperative f...

Ranjit Nair, Milind Tambe

claim paper

Read More »

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

14 years 6 hour ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

click to vote

GECCO
2007
Springer

235views Optimization» more GECCO 2007»

Expensive optimization, uncertain environment: an EA-based solution

14 years 2 months ago

Download www.cs.bham.ac.uk

Real life optimization problems often require finding optimal solution to complex high dimensional, multimodal problems involving computationally very expensive fitness function e...

Maumita Bhattacharya

claim paper

Read More »

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

13 years 9 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

click to vote

INFOCOM
2010
IEEE

180views Communications» more INFOCOM 2010»

Change Management in Enterprise IT Systems: Process Modeling and Capacity-optimal Scheduling

13 years 6 months ago

Download www.seas.upenn.edu

Abstract—We provide a formal model for the Change Management process for Enterprise IT systems, and develop change scheduling algorithms that seek to attain the “change capacit...

Praveen Kumar Muthuswamy, Koushik Kar, Sambit Sahu...

claim paper

Read More »

« Prev « First page 11 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers