Search Sciweavers | Sciweavers

152

NIPS
2007

158views Information Technology» more NIPS 2007»

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

15 years 6 months ago

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...

Alessandro Lazaric, Marcello Restelli, Andrea Bona...

claim paper

Read More »

152

click to vote

SEC
2007

141views Security Privacy» more SEC 2007»

Extending Role Based Access Control Model for Distributed Multidomain Applications

15 years 6 months ago

Download www.uazone.org

This paper presents the results related to the development of a flexible domain-based access control infrastructure for distributed Grid-based Collaborative Environments and Comple...

Yuri Demchenko, Leon Gommans, Cees de Laat

claim paper

Read More »

141

click to vote

IJCAI
2001

174views Artificial Intelligence» more IJCAI 2001»

Complexity of Probabilistic Planning under Average Rewards

15 years 5 months ago

Download www.informatik.uni-freiburg.de

A general and expressive model of sequential decision making under uncertainty is provided by the Markov decision processes (MDPs) framework. Complex applications with very large ...

Jussi Rintanen

claim paper

Read More »

150

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 5 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

131

click to vote

IM
1997

105views Computer Networks» more IM 1997»

The Hollowman: An Innovative ATM Control Architecture

15 years 5 months ago

Download reference.kfupm.edu.sa

The current implementation of out-of-band control in ATM networks inhibits their successful exploitation. The confusion in signalling protocols between application services and th...

Sean Rooney

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers