Search Sciweavers | Sciweavers

22 search results - page 5 / 5

» Tighter Value Function Bounds for Bayesian Reinforcement Lea...

220

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

15 years 8 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

216

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 5 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers