Search Sciweavers | Sciweavers

827 search results - page 33 / 166

» Variational methods for Reinforcement Learning

166

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

16 years 7 days ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

194

click to vote

ICONIP
2007

141views Information Technology» more ICONIP 2007»

Natural Conjugate Gradient in Variational Inference

15 years 8 months ago

Download eprints.pascal-network.org

Variational methods for approximate inference in machine learning often adapt a parametric probability distribution to optimize a given objective function. This view is especially ...

Antti Honkela, Matti Tornio, Tapani Raiko, Juha Ka...

claim paper

Read More »

189

click to vote

SPEECH
2008

114views more SPEECH 2008»

A Reinforcement Learning approach to evaluating state representations in spoken dialogue systems

15 years 6 months ago

Download www.lrdc.pitt.edu

Although dialogue systems have been an area of research for decades, finding accurate ways of evaluating different systems is still a very active subfield since many leading metho...

Joel R. Tetreault, Diane J. Litman

claim paper

Read More »

175

click to vote

IROS
2007
IEEE

157views Robotics» more IROS 2007»

Autonomous blimp control using model-free reinforcement learning in a continuous state and action space

16 years 1 months ago

Download www.informatik.uni-freiburg.de

— In this paper, we present an approach that applies the reinforcement learning principle to the problem of learning height control policies for aerial blimps. In contrast to pre...

Axel Rottmann, Christian Plagemann, Peter Hilgers,...

claim paper

Read More »

196

click to vote

IAT
2008
IEEE

161views Intelligent Agents» more IAT 2008»

Scaling Up Multi-agent Reinforcement Learning in Complex Domains

15 years 7 months ago

Download www3.ntu.edu.sg

TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...

Dan Xiao, Ah-Hwee Tan

claim paper

Read More »

« Prev « First page 33 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers