Search Sciweavers | Sciweavers

4544 search results - page 180 / 909

» Reinforcement Learning with Time

157

click to vote

ICONIP
2007

103views Information Technology» more ICONIP 2007»

Practical Recurrent Learning (PRL) in the Discrete Time Domain

15 years 8 months ago

Download shws.cc.oita-u.ac.jp

One of the authors has proposed a simple learning algorithm for recurrent neural networks, which requires computational cost and memory capacity in practical order O(n2 )[1]. The a...

Mohamad Faizal Bin Samsudin, Takeshi Hirose, Katsu...

claim paper

Read More »

212

click to vote

AGENTS
2000
Springer

119views Security Privacy» more AGENTS 2000»

Adaptivity in agent-based routing for data networks

15 years 11 months ago

Download web.engr.oregonstate.edu

Adaptivity, both of the individual agents and of the interaction structure among the agents, seems indispensable for scaling up multi-agent systems MAS's in noisy environme...

David Wolpert, Sergey Kirshner, Christopher J. Mer...

claim paper

Read More »

188

click to vote

ROBOCUP
2000
Springer

104views Robotics» more ROBOCUP 2000»

Essex Wizards 2000 Team Description

15 years 11 months ago

Download cswww.essex.ac.uk

: This article gives an overview of the Essex Wizards 2000 team participated in the RoboCup 2000 simulator league. A brief description of the agent architecture for the team is int...

Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Kos...

claim paper

Read More »

194

click to vote

ESANN
2008

115views Neural Networks» more ESANN 2008»

15 years 8 months ago

Similarities and differences between policy gradient methods and evolution strategies

Download www.dice.ucl.ac.be

Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

189

click to vote

NIPS
2007

80views Information Technology» more NIPS 2007»

Stable Dual Dynamic Programming

15 years 8 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

« Prev « First page 180 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers