Search Sciweavers | Sciweavers

3412 search results - page 13 / 683

» Efficient Reinforcement Learning

216

click to vote

SCAI
2008

246views Artificial Intelligence» more SCAI 2008»

Fast Learning in an Actor-Critic Architecture with Reward and Punishment

15 years 8 months ago

Download www.lucs.lu.se

Abstract. A reinforcement architecture is introduced that consists of three complementary learning systems with different generalization abilities. The ACTOR learns state-action as...

Christian Balkenius, Stefan Winberg

claim paper

Read More »

196

click to vote

ICTAI
2007
IEEE

167views Artificial Intelligence» more ICTAI 2007»

Multi-agent Reinforcement Learning Using Strategies and Voting

16 years 1 months ago

Download users.auth.gr

Multiagent learning attracts much attention in the past few years as it poses very challenging problems. Reinforcement Learning is an appealing solution to the problems that arise...

Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...

claim paper

Read More »

195

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 4 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

209

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Reinforcement learning agents with primary knowledge designed by analytic hierarchy process

16 years 7 days ago

Download k2x.ice.ous.ac.jp

This paper presents a novel model of reinforcement learning agents. A feature of our learning agent model is to integrate analytic hierarchy process (AHP) into a standard reinforc...

Kengo Katayama, Takahiro Koshiishi, Hiroyuki Narih...

claim paper

Read More »

158

Voted

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

16 years 7 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

« Prev « First page 13 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers