Search Sciweavers | Sciweavers

4544 search results - page 71 / 909

» Reinforcement Learning with Time

152

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

15 years 7 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

137

click to vote

FLAIRS
1998

90views Artificial Intelligence» more FLAIRS 1998»

Optimizing Production Manufacturing Using Reinforcement Learning

15 years 7 months ago

Download www.aaai.org

Manyindustrial processes involve makingparts with an assemblyof machines, where each machinecarries out an operation on a part, and the finished product requires a wholeseries of ...

Sridhar Mahadevan, Georgios Theocharous

claim paper

Read More »

176

click to vote

JAIR
2002

99views more JAIR 2002»

Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System

15 years 5 months ago

Download www.eecs.umich.edu

Designing the dialogue policy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing a di...

Satinder P. Singh, Diane J. Litman, Michael J. Kea...

claim paper

Read More »

189

click to vote

INLG
2010
Springer

134views Natural Language Processing» more INLG 2010»

Hierarchical Reinforcement Learning for Adaptive Text Generation

15 years 4 months ago

Download www.aclweb.org

We present a novel approach to natural language generation (NLG) that applies hierarchical reinforcement learning to text generation in the wayfinding domain. Our approach aims to...

Nina Dethlefs, Heriberto Cuayáhuitl

claim paper

Read More »

358

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

14 years 4 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 71 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers