Search Sciweavers | Sciweavers

36 search results - page 7 / 8

» Posterior Weighted Reinforcement Learning with State Uncerta...

174

click to vote

IROS
2008
IEEE

144views Robotics» more IROS 2008»

Learning nonparametric policies by imitation

16 years 14 days ago

Download www.cs.washington.edu

— A long cherished goal in artiﬁcial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ...

David B. Grimes, Rajesh P. N. Rao

claim paper

Read More »

130

click to vote

JETAI
2002

69views more JETAI 2002»

The interaction of representations and planning objectives for decision-theoretic planning tasks

15 years 5 months ago

Download idm-lab.org

We study decision-theoretic planning or reinforcement learning in the presence of traps such as steep slopes for outdoor robots or staircases for indoor robots. In this case, achi...

Sven Koenig, Yaxin Liu

claim paper

Read More »

185

click to vote

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

16 years 16 days ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

194

click to vote

NEUROSCIENCE
2001
Springer

260views Applied Computing» more NEUROSCIENCE 2001»

Role of the Cerebellum in Time-Critical Goal-Oriented Behaviour: Anatomical Basis and Control Principle

15 years 10 months ago

Download www.tech.plym.ac.uk

The Brain is a slow computer yet humans can skillfully play games such as tennis where very fast reactions are required. Of particular interest is the evidence for strategic thinki...

Guido Bugmann

claim paper

Read More »

160

click to vote

NIPS
1998

102views Information Technology» more NIPS 1998»

An Entropic Estimator for Structure Discovery

15 years 7 months ago

Download www.merl.com

We introduce a novel framework for simultaneous structure and parameter learning in hidden-variable conditional probability models, based on an entropic prior and a solution for i...

Matthew Brand

claim paper

Read More »

« Prev « First page 7 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers