Search Sciweavers | Sciweavers

131

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 4 months ago

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

143

Voted

DATE
2007
IEEE

148views Hardware» more DATE 2007»

Temperature aware task scheduling in MPSoCs

15 years 10 months ago

Download www.date-conference.com

In deep submicron circuits, elevation in temperatures has brought new challenges in reliability, timing, performance, cooling costs and leakage power. Conventional thermal managem...

Ayse Kivilcim Coskun, Tajana Simunic Rosing, Keith...

claim paper

Read More »

146

Voted

IASTEDSE
2004

147views Software Engineering» more IASTEDSE 2004»

An authorization and access control scheme for pervasive computing

15 years 5 months ago

Download www.nokia.com

The existence of a central security authority is too restrictive for pervasive computing environments. Existing distributed security schemes fail in a pervasive computing environm...

Linda Staffans, Titos Saridakis

claim paper

Read More »

153

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 1 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

140

Voted

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

14 years 10 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers