Search Sciweavers | Sciweavers

85 search results - page 12 / 17

» Approximate Policy Iteration with a Policy Language Bias

click to vote

SOUPS
2009
ACM

137views Security Privacy» more SOUPS 2009»

A "nutrition label" for privacy

14 years 1 months ago

Download cups.cs.cmu.edu

We used an iterative design process to develop a privacy label that presents to consumers the ways organizations collect, use, and share personal information. Many surveys have sh...

Patrick Gage Kelley, Joanna Bresee, Lorrie Faith C...

claim paper

Read More »

click to vote

IEEEPACT
2009
IEEE

184views Distributed And Parallel Com...» more IEEEPACT 2009»

Using Aggressor Thread Information to Improve Shared Cache Management for CMPs

14 years 2 months ago

Download maggini.eng.umd.edu

—Shared cache allocation policies play an important role in determining CMP performance. The simplest policy, LRU, allocates cache implicitly as a consequence of its replacement ...

Wanli Liu, Donald Yeung

claim paper

Read More »

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

13 years 7 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

14 years 22 days ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

click to vote

UAI
2004

121views Artificial Intelligence» more UAI 2004»

Discretized Approximations for POMDP with Average Cost

13 years 8 months ago

Download web.mit.edu

In this paper, we propose a new lower approximation scheme for POMDP with discounted and average cost criterion. The approximating functions are determined by their values at a fi...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

« Prev « First page 12 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers