Search Sciweavers | Sciweavers

567 search results - page 48 / 114

» Regularized Policy Iteration

135

Voted

CCS
2005
ACM

105views Security Privacy» more CCS 2005»

Preventing attribute information leakage in automated trust negotiation

15 years 8 months ago

Download www4.ncsu.edu

Automated trust negotiation is an approach which establishes trust between strangers through the bilateral, iterative disclosure of digital credentials. Sensitive credentials are ...

Keith Irwin, Ting Yu

claim paper

Read More »

139

Voted

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 2 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

133

Voted

KDD
2009
ACM

228views Data Mining» more KDD 2009»

A generalized Co-HITS algorithm and its application to bipartite graphs

16 years 3 months ago

Download appsrv.cse.cuhk.edu.hk

Recently many data types arising from data mining and Web search applications can be modeled as bipartite graphs. Examples include queries and URLs in query logs, and authors and ...

Hongbo Deng, Michael R. Lyu, Irwin King

claim paper

Read More »

125

Voted

ISLPED
1999
ACM

91views Hardware» more ISLPED 1999»

Stochastic modeling of a power-managed system: construction and optimization

15 years 7 months ago

Download hydrogen.ws.binghamton.edu

-- The goal of a dynamic power management policy is to reduce the power consumption of an electronic system by putting system components into different states, each representing ce...

Qinru Qiu, Qing Wu, Massoud Pedram

claim paper

Read More »

121

Voted

ECAI
2006
Springer

194views Artificial Intelligence» more ECAI 2006»

Strategic Foresighted Learning in Competitive Multi-Agent Games

15 years 6 months ago

Download homepages.cwi.nl

We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-agent games. We make the observation that in a competitive setting with adaptive...

Pieter Jan't Hoen, Sander M. Bohte, Han La Poutr&e...

claim paper

Read More »

« Prev « First page 48 / 114 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers