Search Sciweavers | Sciweavers

688 search results - page 76 / 138

» Using reinforcement learning to adapt an imitation task

194

click to vote

JAIR
2008

148views more JAIR 2008»

Learning Partially Observable Deterministic Action Models

15 years 5 months ago

Download www.jair.org

We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...

Eyal Amir, Allen Chang

claim paper

Read More »

136

click to vote

NIPS
2004

92views Information Technology» more NIPS 2004»

Responding to Modalities with Different Latencies

15 years 6 months ago

Download books.nips.cc

Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...

Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...

claim paper

Read More »

152

Voted

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 8 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

147

click to vote

AAAI
2000

104views Intelligent Agents» more AAAI 2000»

Inter-Layer Learning Towards Emergent Cooperative Behavior

15 years 6 months ago

Download www.cs.cmu.edu

As applications for artificially intelligent agents increase in complexity we can no longer rely on clever heuristics and hand-tuned behaviors to develop their programming. Even t...

Shawn Arseneau, Wei Sun, Changpeng Zhao, Jeremy R....

claim paper

Read More »

159

click to vote

FLAIRS
2010

148views Artificial Intelligence» more FLAIRS 2010»

Decision-Theoretic Simulated Annealing

15 years 3 months ago

Download cs.gettysburg.edu

The choice of a good annealing schedule is necessary for good performance of simulated annealing for combinatorial optimization problems. In this paper, we pose the simulated anne...

Todd W. Neller, Christopher J. La Pilla

claim paper

Read More »

« Prev « First page 76 / 138 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers