Search Sciweavers | Sciweavers

92 search results - page 11 / 19

» A General Convergence Method for Reinforcement Learning in t...

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

13 years 8 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

click to vote

ALT
2003
Springer

133views Machine Learning» more ALT 2003»

On the Existence and Convergence of Computable Universal Priors

14 years 3 months ago

Download www.hutter1.net

Solomonoﬀ uniﬁed Occam’s razor and Epicurus’ principle of multiple explanations to one elegant, formal, universal theory of inductive inference, which initiated the ﬁeld...

Marcus Hutter

claim paper

Read More »

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

14 years 1 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

14 years 7 days ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

click to vote

ATAL
2007
Springer

143views Intelligent Agents» more ATAL 2007»

On discovery and learning of models with predictive representations of state for agents with continuous actions and observations

13 years 10 months ago

Download web.mit.edu

Models of agent-environment interaction that use predictive state representations (PSRs) have mainly focused on the case of discrete observations and actions. The theory of discre...

David Wingate, Satinder P. Singh

claim paper

Read More »

« Prev « First page 11 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers