Search Sciweavers | Sciweavers

1235 search results - page 138 / 247

» Reinforcement learning in a nutshell

154

Voted

COLING
2000

194views Computational Linguistics» more COLING 2000»

Automatic Optimization of Dialogue Management

15 years 4 months ago

Download www.cis.upenn.edu

Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...

Diane J. Litman, Michael S. Kearns, Satinder P. Si...

claim paper

Read More »

110

Voted

ATAL
2006
Springer

135views Intelligent Agents» more ATAL 2006»

Learning the required number of agents for complex tasks

15 years 7 months ago

Download www.damas.ift.ulaval.ca

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa

claim paper

Read More »

133

click to vote

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

15 years 4 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

174

Voted

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 7 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

133

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 4 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

« Prev « First page 138 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers