Search Sciweavers | Sciweavers

181 search results - page 11 / 37

» On Policy Learning in Restricted Policy Spaces

click to vote

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

13 years 10 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

click to vote

CCS
2011
ACM

193views Security Privacy» more CCS 2011»

Policy auditing over incomplete logs: theory, implementation and applications

12 years 7 months ago

Download www.cs.cmu.edu

We present the design, implementation and evaluation of an algorithm that checks audit logs for compliance with privacy and security policies. The algorithm, which we name reduce,...

Deepak Garg, Limin Jia, Anupam Datta

claim paper

Read More »

click to vote

CCS
2010
ACM

190views Security Privacy» more CCS 2010»

Adjustable autonomy for cross-domain entitlement decisions

13 years 5 months ago

Download www.dist-systems.bbn.com

Cross-domain information exchange is a growing problem, as business and governmental organizations increasingly need to integrate their information systems with those of partially...

Jacob Beal, Jonathan Webb, Michael Atighetchi

claim paper

Read More »

click to vote

MICAI
2009
Springer

188views Artificial Intelligence» more MICAI 2009»

A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots

14 years 2 months ago

Download ccc.inaoep.mx

Reinforcement Learning is a commonly used technique in robotics, however, traditional algorithms are unable to handle large amounts of data coming from the robot’s sensors, requi...

Julio H. Zaragoza, Eduardo F. Morales

claim paper

Read More »

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

14 years 2 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

« Prev « First page 11 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers