Search Sciweavers | Sciweavers

1760 search results - page 34 / 352

» Learning from Partial Observations

184

click to vote

FCS
2006

103views Computer Science» more FCS 2006»

From Sequential Processes to Grid Computation

15 years 8 months ago

Download ww1.ucmss.com

: We introduce an extended model for view-centric reasoning, EVCR, that provides more nsive and flexible abstractions for representing actual concurrency. The theory of Communicati...

Mark Burgin, Marc Smith

claim paper

Read More »

203

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

16 years 1 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

166

click to vote

ICRA
2009
IEEE

161views Robotics» more ICRA 2009»

Learning and generalization of motor skills by learning from demonstration

16 years 1 months ago

Download www-clmc.usc.edu

— We provide a general approach for learning robotic motor skills from human demonstration. To represent an observed movement, a non-linear differential equation is learned such ...

Peter Pastor, Heiko Hoffmann, Tamim Asfour, Stefan...

claim paper

Read More »

196

click to vote

ALT
2005
Springer

137views Machine Learning» more ALT 2005»

Defensive Universal Learning with Experts

16 years 3 months ago

Download www.idsia.ch

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedba...

Jan Poland, Marcus Hutter

claim paper

Read More »

164

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 34 / 352 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers