Search Sciweavers | Sciweavers

153

AAAI
2007

142views Intelligent Agents» more AAAI 2007»

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison

15 years 9 months ago

Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

189

click to vote

TVLSI
2008

107views more TVLSI 2008»

Static and Dynamic Temperature-Aware Scheduling for Multiprocessor SoCs

15 years 6 months ago

Download www.bu.edu

Thermal hot spots and high temperature gradients degrade reliability and performance, and increase cooling costs and leakage power. In this paper, we explore the benefits of temper...

Ayse Kivilcim Coskun, T. T. Rosing, Keith Whisnant...

claim paper

Read More »

188

click to vote

CHIMIT
2008
ACM

70views Human Computer Interaction» more CHIMIT 2008»

Policy-based IT automation: the role of human judgment

15 years 8 months ago

Download www.plunk.org

Policy-based automation is emerging as a viable approach to IT systems management, codifying high-level business goals into executable specifications for governing IT operations. ...

Eser Kandogan, John H. Bailey, Paul P. Maglio, Ebe...

claim paper

Read More »

311

click to vote

TON
2010

151views more TON 2010»

Throughput Optimal Distributed Power Control of Stochastic Wireless Networks

15 years 1 months ago

Download pantheon.yale.edu

The Maximum Differential Backlog (MDB) control policy of Tassiulas and Ephremides has been shown to adaptively maximize the stable throughput of multihop wireless networks with ran...

Yufang Xi, Edmund M. Yeh

claim paper

Read More »

135

Voted

ICRA
2010
IEEE

149views Robotics» more ICRA 2010»

A simple learning strategy for high-speed quadrocopter multi-flips

15 years 5 months ago

Download www.idsc.ethz.ch

— We describe a simple and intuitive policy gradient method for improving parametrized quadrocopter multi-ﬂips by combining iterative experiments with information from a ﬁrst...

Sergei Lupashin, Angela Schöllig, Michael She...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers