Search Sciweavers | Sciweavers

69 search results - page 8 / 14

» PAC-Bayesian Policy Evaluation for Reinforcement Learning

179

click to vote

ICAC
2009
IEEE

226views Applied Computing» more ICAC 2009»

Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

15 years 3 months ago

Download www.scss.tcd.ie

Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

154

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 11 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

187

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

16 years 6 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

216

click to vote

BERTINORO
2005
Springer

175views Information Technology» more BERTINORO 2005»

Emergent Consensus in Decentralised Systems Using Collaborative Reinforcement Learning

15 years 11 months ago

Download www.scss.tcd.ie

Abstract. This paper describes the application of a decentralised coordination algorithm, called Collaborative Reinforcement Learning (CRL), to two diﬀerent distributed system pr...

Jim Dowling, Raymond Cunningham, Anthony Harringto...

claim paper

Read More »

163

click to vote

AAAI
2007

122views Intelligent Agents» more AAAI 2007»

RETALIATE: Learning Winning Policies in First-Person Shooter Games

15 years 7 months ago

Download www.cse.lehigh.edu

In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team firstperson shooter games. RETALIATE has three crucial chara...

Megan Smith, Stephen Lee-Urban, Hector Muño...

claim paper

Read More »

« Prev « First page 8 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers