Search Sciweavers | Sciweavers

200 search results - page 23 / 40

» Point-Based Policy Iteration

180

click to vote

PERCOM
2007
ACM

189views Computer Networks» more PERCOM 2007»

Sensor Scheduling for Optimal Observability Using Estimation Entropy

16 years 6 months ago

Download people.eng.unimelb.edu.au

We consider sensor scheduling as the optimal observability problem for partially observable Markov decision processes (POMDP). This model fits to the cases where a Markov process ...

Mohammad Rezaeian

claim paper

Read More »

179

Voted

SOUPS
2009
ACM

137views Security Privacy» more SOUPS 2009»

A "nutrition label" for privacy

16 years 1 months ago

Download cups.cs.cmu.edu

We used an iterative design process to develop a privacy label that presents to consumers the ways organizations collect, use, and share personal information. Many surveys have sh...

Patrick Gage Kelley, Joanna Bresee, Lorrie Faith C...

claim paper

Read More »

175

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 7 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

196

Voted

ICML
1996
IEEE

159views Machine Learning» more ICML 1996»

Discretizing Continuous Attributes While Learning Bayesian Networks

16 years 7 months ago

Download www.cs.huji.ac.il

We introduce a method for learning Bayesian networks that handles the discretization of continuous variables as an integral part of the learning process. The main ingredient in th...

Moisés Goldszmidt, Nir Friedman

claim paper

Read More »

147

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

16 years 1 months ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

« Prev « First page 23 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers