Search Sciweavers | Sciweavers

377 search results - page 17 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

161

click to vote

EPIA
1995
Springer

110views Artificial Intelligence» more EPIA 1995»

Using Stochastic Grammars to Learn Robotic Tasks

15 years 10 months ago

Download welcome.isr.ist.utl.pt

Abstract. The paper introduces a reinforcement learning-based methodology for performance improvement of Intelligent Controllers. The translation interfaces of a 3-level Hierarchic...

Pedro U. Lima, George N. Saridis

claim paper

Read More »

178

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 8 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

171

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 7 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

192

Voted

JAIR
2000

131views more JAIR 2000»

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

15 years 6 months ago

Download www.jair.org

This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...

Marilyn A. Walker

claim paper

Read More »

152

click to vote

DATE
2000
IEEE

112views Hardware» more DATE 2000»

The Road to Better Reliability and Yield Embedded DfM Tools

15 years 11 months ago

Download www.date-conference.com

This paper gives an overview of the different tools, needed for accomplishing optimal IC manufacturability and rapid technology learning during the successive phases of process ma...

Kees Veelenturf

claim paper

Read More »

« Prev « First page 17 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers