Search Sciweavers | Sciweavers

4544 search results - page 38 / 909

» Reinforcement Learning with Time

211

click to vote

JAIR
2008

148views more JAIR 2008»

Learning Partially Observable Deterministic Action Models

15 years 6 months ago

Download www.jair.org

We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...

Eyal Amir, Allen Chang

claim paper

Read More »

172

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 6 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

209

click to vote

ABIALS
2008
Springer

281views Artificial Intelligence» more ABIALS 2008»

Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

15 years 8 months ago

Download www.isir.upmc.fr

Factored Reinforcement Learning (frl) is a new technique to solve Factored Markov Decision Problems (fmdps) when the structure of the problem is not known in advance. Like Anticipa...

Olivier Sigaud, Martin V. Butz, Olga Kozlova, Chri...

claim paper

Read More »

120

click to vote

ATAL
2004
Springer

221views Intelligent Agents» more ATAL 2004»

When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents

15 years 11 months ago

Download leibniz.cs.huji.ac.il

This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, speciﬁcally, geneticlearning-parentin...

Michael Berger, Jeffrey S. Rosenschein

claim paper

Read More »

189

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

15 years 7 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

« Prev « First page 38 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers