Search Sciweavers | Sciweavers

334 search results - page 55 / 67

» How to Dynamically Merge Markov Decision Processes

230

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 11 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

189

Voted

EOR
2006

66views more EOR 2006»

Performance prediction of an unmanned airborne vehicle multi-agent system

15 years 7 months ago

Download www.damas.ift.ulaval.ca

Consider unmanned airborne vehicle (UAV) control agents in a dynamic multi-agent system. The agents must have a set of goals such as destination airport and intermediate positions...

Zhaotong Lian, Abhijit Deshmukh

claim paper

Read More »

234

click to vote

ISCA
2009
IEEE

318views Hardware» more ISCA 2009»

Thread criticality predictors for dynamic performance, power, and resource management in chip multiprocessors

16 years 2 months ago

Download www.princeton.edu

With the shift towards chip multiprocessors (CMPs), exploiting and managing parallelism has become a central problem in computer systems. Many issues of parallelism management boi...

Abhishek Bhattacharjee, Margaret Martonosi

claim paper

Read More »

180

click to vote

SAC
2010
ACM

199views Applied Computing» more SAC 2010»

MetaSelf: an architecture and a development method for dependable self-* systems

16 years 2 months ago

Download www.dcs.bbk.ac.uk

This paper proposes a software architecture and a development process for engineering dependable and controllable self-organising (SO) systems. Our approach addresses dependabilit...

Giovanna Di Marzo Serugendo, John S. Fitzgerald, A...

claim paper

Read More »

206

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 8 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

« Prev « First page 55 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers