Search Sciweavers | Sciweavers

226 search results - page 33 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

13 years 8 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

click to vote

ATAL
2006
Springer

177views Intelligent Agents» more ATAL 2006»

Convergence analysis for collective vocabulary development

13 years 11 months ago

Download www.isrl.illinois.edu

We study how decentralized agents can develop a shared vocabulary without global coordination. Answering this question can help us understand the emergence of many communication s...

Jun Wang, Les Gasser, Jim Houk

claim paper

Read More »

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

13 years 11 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

13 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

click to vote

ACL
1998

129views Computational Linguistics» more ACL 1998»

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email

13 years 8 months ago

Download acl.eldoc.ub.rug.nl

This paper describes a novel method by which a dialogue agent can learn to choose an optimal dialogue strategy. While it is widely agreed that dialogue strategies should be formul...

Marilyn A. Walker, Jeanne Frommer, Shrikanth Naray...

claim paper

Read More »

« Prev « First page 33 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers