Search Sciweavers | Sciweavers

850 search results - page 83 / 170

» Using Machine Learning to Guide Architecture Simulation

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

14 years 9 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 9 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

MINENET
2005
ACM

166views Computer Networks» more MINENET 2005»

ACAS: automated construction of application signatures

14 years 1 months ago

Download conferences.sigcomm.org

An accurate mapping of trafﬁc to applications is important for a broad range of network management and measurement tasks. Internet applications have traditionally been identiﬁ...

Patrick Haffner, Subhabrata Sen, Oliver Spatscheck...

claim paper

Read More »

click to vote

DAC
1994
ACM

99views Computer Architecture» more DAC 1994»

Automatic Verification of Pipelined Microprocessors

14 years 6 days ago

Download www.cs.york.ac.uk

Abstract - We address the problem of automatically verifying large digital designs at the logic level, against high-level specifications. In this paper, we present a methodology wh...

Vishal Bhagwati, Srinivas Devadas

claim paper

Read More »

click to vote

ICML
2007
IEEE

136views Machine Learning» more ICML 2007»

Combining online and offline knowledge in UCT

14 years 9 months ago

Download www.machinelearning.org

The UCT algorithm learns a value function online using sample-based search. The TD() algorithm can learn a value function offline for the on-policy distribution. We consider three...

Sylvain Gelly, David Silver

claim paper

Read More »

« Prev « First page 83 / 170 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers