Search Sciweavers | Sciweavers

85 search results - page 8 / 17

» Approximate Policy Iteration with a Policy Language Bias

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Improving Approximate Value Iteration Using Memories and Predictive State Representations

13 years 8 months ago

Download www.aaai.org

Planning in partially-observable dynamical systems is a challenging problem, and recent developments in point-based techniques such as Perseus significantly improve performance as...

Michael R. James, Ton Wessling, Nikos A. Vlassis

claim paper

Read More »

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

13 years 7 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

click to vote

WMCSA
2002
IEEE

94views Communications» more WMCSA 2002»

Extensible Adaptation via Constraint Solving

14 years 8 days ago

Download infoscience.epfl.ch

Applications running on a mobile and wireless devices must be able to adapt gracefully to limited and ﬂuctuating network resources. The variety of applications, platforms upon w...

Yuri Dotsenko, Eyal de Lara, Dan S. Wallach, Willy...

claim paper

Read More »

click to vote

CCS
2011
ACM

193views Security Privacy» more CCS 2011»

Policy auditing over incomplete logs: theory, implementation and applications

12 years 7 months ago

Download www.cs.cmu.edu

We present the design, implementation and evaluation of an algorithm that checks audit logs for compliance with privacy and security policies. The algorithm, which we name reduce,...

Deepak Garg, Limin Jia, Anupam Datta

claim paper

Read More »

click to vote

RSS
2007

176views Robotics» more RSS 2007»

Active Policy Learning for Robot Planning and Exploration under Uncertainty

13 years 8 months ago

Download www.roboticsproceedings.org

Abstract— This paper proposes a simulation-based active policy learning algorithm for ﬁnite-horizon, partially-observed sequential decision processes. The algorithm is tested i...

Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...

claim paper

Read More »

« Prev « First page 8 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers