Search Sciweavers | Sciweavers

4544 search results - page 152 / 909

» Reinforcement Learning with Time

118

Voted

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 9 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

148

Voted

COLING
2000

194views Computational Linguistics» more COLING 2000»

Automatic Optimization of Dialogue Management

15 years 4 months ago

Download www.cis.upenn.edu

Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...

Diane J. Litman, Michael S. Kearns, Satinder P. Si...

claim paper

Read More »

119

click to vote

ISCAS
2002
IEEE

153views Hardware» more ISCAS 2002»

Biological learning modeled in an adaptive floating-gate system

15 years 7 months ago

Download users.ece.gatech.edu

We have implemented an aspect of learning and memory in the nervous system using analog electronics. Using a simple synaptic circuit we realize networks with Hebbian type adaptati...

Christal Gordon, Paul E. Hasler

claim paper

Read More »

104

Voted

ATAL
2006
Springer

135views Intelligent Agents» more ATAL 2006»

Learning the required number of agents for complex tasks

15 years 6 months ago

Download www.damas.ift.ulaval.ca

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa

claim paper

Read More »

124

Voted

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

15 years 4 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

« Prev « First page 152 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers