Search Sciweavers | Sciweavers

128 search results - page 20 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

click to vote

COLT
2007
Springer

103views Machine Learning» more COLT 2007»

Strategies for Prediction Under Imperfect Monitoring

14 years 1 months ago

Download eprints.pascal-network.org

Abstract. We propose simple randomized strategies for sequential prediction under imperfect monitoring, that is, when the forecaster does not have access to the past outcomes but r...

Gábor Lugosi, Shie Mannor, Gilles Stoltz

claim paper

Read More »

click to vote

AGENTS
1999
Springer

126views Security Privacy» more AGENTS 1999»

General Principles of Learning-Based Multi-Agent Systems

13 years 12 months ago

Download web.engr.oregonstate.edu

We consider the problem of how to design large decentralized multiagent systems (MAS’s) in an automated fashion, with little or no hand-tuning. Our approach has each agent run a...

David Wolpert, Kevin R. Wheeler, Kagan Tumer

claim paper

Read More »

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

13 years 7 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

14 years 1 months ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

13 years 7 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

« Prev « First page 20 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers