Search Sciweavers | Sciweavers

27

MICAI
2009
Springer

188views Artificial Intelligence» more MICAI 2009»

A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots

14 years 2 months ago

Reinforcement Learning is a commonly used technique in robotics, however, traditional algorithms are unable to handle large amounts of data coming from the robot’s sensors, requi...

Julio H. Zaragoza, Eduardo F. Morales

claim paper

Read More »

34

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

13 years 7 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

25

click to vote

ICMLA
2010

211views Machine Learning» more ICMLA 2010»

Ensembles of Neural Networks for Robust Reinforcement Learning

13 years 5 months ago

Download ahans.de

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their traini...

Alexander Hans, Steffen Udluft

claim paper

Read More »

16

click to vote

NORDSEC
2009
Springer

160views Security Privacy» more NORDSEC 2009»

Towards Practical Enforcement Theories

14 years 2 months ago

Download disi.unitn.it

Runtime enforcement is a common mechanism for ensuring that program executions adhere to constraints speciﬁed by a security policy. It is based on two simple ideas: the enforceme...

Nataliia Bielova, Fabio Massacci, Andrea Michelett...

claim paper

Read More »

22

click to vote

GECCO
2004
Springer

142views Optimization» more GECCO 2004»

Improving MACS Thanks to a Comparison with 2TBNs

14 years 1 months ago

Download www.cs.york.ac.uk

Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classiﬁer Systems research. This framework is mostly used in the context ...

Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers