Search Sciweavers | Sciweavers

20

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

14 years 3 months ago

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

31

click to vote

ICCBR
2003
Springer

114views Automated Reasoning» more ICCBR 2003»

Evaluation of Case-Based Maintenance Strategies in Software Design

14 years 2 months ago

Download cisucpt.dei.uc.pt

CBR applications running in real domains can easily reach thousands of cases, which are stored in the case library. Retrieval times can increase greatly if the retrieval algorithm ...

Paulo Gomes, Francisco C. Pereira, Paulo Paiva, Nu...

claim paper

Read More »

32

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

14 years 20 days ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

31

click to vote

USENIX
2003

116views Operating System» more USENIX 2003»

CUP: Controlled Update Propagation in Peer-to-Peer Networks

13 years 10 months ago

Download www.eecs.harvard.edu

— This paper proposes CUP, a protocol for performing Controlled Update Propagation to maintain caches of metadata in peer-to-peer networks. To moderate propagation without imposi...

Mema Roussopoulos, Mary Baker

claim paper

Read More »

30

click to vote

FGCS
2007

99views more FGCS 2007»

Mining performance data for metascheduling decision support in the Grid

13 years 9 months ago

Download www.liacs.nl

: Metaschedulers in the Grid needs dynamic information to support their scheduling decisions. Job response time on computing resources, for instance, is such a performance metric. ...

Hui Li, David L. Groep, Lex Wolters

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers